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UNIT 1 Getting started 


Study guide for Unit 1 


This unit reviews material that you will need as a basis for your study of 
MST209. You should have covered most of it in previous courses. 


The only non-text study for this unit involves the use of the computer algebra 
package for the course. All the computer activities appear in Section 7. 
These can be studied when you have finished each section. 


The time you will require to study this unit will depend on how familiar 
you are with the material that it contains. Most sections begin with a short 
diagnostic test. If you find that you can answer the test question(s) correctly, 
then it is probably safe to progress directly to the next section — you can 
always return to re-read a section if the need arises, and you can further 
check your knowledge by trying the other exercises. If you choose not to 
study a subsection in detail, do check for any new ideas introduced, and 
make sure that you look at those. (New terms are set in bold type.) 


Because this unit reviews a large amount of material, it is longer than other 
units in the course; so, even if the material is not new, you may find that 
there is a good deal to cover in one week. If you do not have time to 
study the whole unit, make sure that you are familiar with the material 
in Sections 2 and 3 and Subsections 5.1, 5.2 and 6.1, as this material is 
particularly important. You can always use this unit later to revise other 
topics when you find you need them. 


The unit is structured so that it is natural to study the material in the order 
in which it appears in the text. However, you can if you wish leave study of 
Section 4 until last. 


Unit 1 Getting started 


Introduction 


The main purpose of this unit is to review ideas that you should have met 
before, and that you will need as a basis for your study of MST209. The unit 
focuses mainly on mathematical techniques, but also covers some examples 
involving skills in the application of mathematics. The use of mathemat- 
ics to investigate questions arising in non-mathematical contexts is broadly 
referred to as ‘mathematical modelling’. In this course, the study of math- 
ematical techniques will quite often be separated from their use in models, 
as this enables you to practise the mathematical methods before you go on 
to use them. However, the methods introduced in the course are chosen 
because of their wide application in modelling. 


The unit contains a number of ‘standard formulae’: for example, for the 
solution of a quadratic equation, for expanding sin(a+ 6) and cos(a + b), 
and for the derivatives and integrals of standard functions. It is helpful if 
you are able to remember such formulae, but not essential; they are all given 
in the course Handbook. You do need to be aware that the formulae exist, 
however, and to be able to find them in the Handbook and to apply them. 


The computer algebra package for the course can be used to help with much 
of the work in this unit, and the unit reviews how this is done. However, the 
majority of the unit concentrates on how the mathematical techniques can 
be performed ‘by hand’; that is, without use of a computer (or calculator). 
This is because, in the long run, a familiarity and confidence with using 
common mathematical formulae and techniques by hand will speed up your 
study, and because it is not always convenient to resort to a computer. 
(Having said that, the computer remains a valuable tool, both for checking 
hand calculations and for addressing problems too complicated or time- 
consuming to be worked on by hand.) 


Section 1 starts by reviewing some basic points about numbers. Sections 2 
and 3 cover a number of important standard functions: linear, quadratic, 
logarithmic and exponential functions in Section 2, and trigonometric func- 
tions in Section 3. All these functions occur frequently. These sections also 
remind you of some important mathematical techniques: for example, for 
manipulating algebraic expressions, for solving a quadratic equation, and for 
manipulating expressions involving functions such as sin and cos. Section 4 
covers some basic ideas about complex numbers. 


Sections 5 and 6 discuss the fundamental concepts of calculus: differentiation 
and integration. It is important that you understand what these ideas are, 
and how they arise in models. These sections also provide plenty of exercises 
on performing basic calculus operations by hand, as being able to do this 
quickly will stand you in good stead for the rest of the course. 


The final section shows how the computer algebra package for the course 
may be used in many of the techniques in the unit. 


1 Numbers, measurement and accuracy 


This section covers some fundamental terminology and notation relating to 
numbers. 


Generally, we advise you to 
first attempt all exercises 
without using a computer, 
unless they are specifically 
marked as computer 
activities. 


Note, however, that the 
computer algebra package for 
the course can be used to find 
most derivatives and many 
integrals. 


Section 1 Numbers, measurement and accuracy 


Diagnostic test 1.1 


Do Exercise 1.1 on page 9, and check your answers with the solutions given 
on page 53. If you are happy with your answers, you may proceed directly 
to Section 2. 


We distinguish various types of number. The integers are the positive and 
negative whole numbers, together with zero: 


ee ee ee me ee es 


We denote the set of all integers by Z. These are fine for counting, but 
insufficient for measuring lengths, for example. Non-integer quantities can 
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sometimes be represented exactly, for example as fractions (such as 3) or 


roots (such as V5). Often in this course, however, decimals will be used. In 
decimal notation, we can sometimes express a number exactly, but frequently 
we need to approximate. We may write a number correct to a particular 
number of decimal places, as in, for example, 

4 = 2.33 (to two decimal places), 


V2 = 1.4142 (to four decimal places). 


For large or small numbers we extend decimal notation by ‘taking out’ pow- 
ers of 10, as in, for example, 


3.414 x 10° (for 3414000), 
3.42 x 107" (for 0.000 000 342). 


In this course we shall use the convention that a non-zero number is ex- 
pressed in scientific notation as 


+b x 10", 


where 1 < 6 < 10 and c is an integer. 


If 3.414 is quoted to an accuracy of three decimal places, then 
3.414 x 10® = 3414000 almost certainly is not. It is, however, accurate to 
four significant figures. That is, giving a number to ‘so many decimal 
places’ indicates its absolute level of accuracy, while to ‘so many significant 
figures’ indicates the accuracy relative to the size of the number itself. 


Many numbers cannot be represented exactly by a decimal, but any such 
number can be approximated arbitrarily closely by a decimal. For example, 
taking more and more decimal places, the number 7 is 


3.1, 3.14, 3.142, 3.1416, 3.14159, 3.141593, 3.1415927, .... 


Any number that can be approximated arbitrarily closely by a decimal (or 
is actually equal to one) is called a real number. We denote the set of all 
real numbers by R. Those real numbers that can be expressed exactly as a 
fraction (such as 75) are referred to as rational numbers, and those that 
are not equal to any such fraction (such as 7 and V2) are called irrational. 


Sometimes one wishes to reduce the accuracy to which a number is given. 
For example, a calculator may give you the result of some calculation to 
an accuracy of ten significant figures, while the assumptions on which the 
calculation was based justify quoting the result only to three significant 
figures. 


Other conventions for 
scientific notation exist; for 
example, choosing b to satisfy 
01<b<1. 


Rationals have either a 
terminating or a recurring 
decimal representation; 
irrationals do not. For 
example, 3 has the 
terminating decimal 
representation 1.5, whereas 4 
has the recurring decimal 
representation 1.333 333.... 
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The process of reducing the number of significant figures or decimal places 
to which a number is expressed is referred to as rounding. To express 
3.14159 to three decimal places, we simply take the number expressed to 
three decimal places that is closest to 3.14159, which is 3.142. It is usual to 
make clear that a number is accurate only to some specified level by writing, 
for example, ‘3.142 (to three decimal places)’. The process of rounding 
is straightforward except in one case. The number 4.15, for example, is 
equally close to 4.1 and 4.2. To round 4.15 to one decimal place we shall 
use a standard, if arbitrary, convention, in which 5 is always rounded up for 
positive numbers. So 4.15 is expressed as 4.2 (to one decimal place). For 
negative numbers, 5 is rounded down: for example, —4.15 is expressed as 
—4.2 (to one decimal place). 


The choice of how accurately to express some real number depends on cir- 
cumstances. If the number represents a measurement, you may know how 
accurately the measurement was made. For example, using a tape measure 
(see Figure 1.1) to measure a length carefully, one might hope to measure 
it to the nearest millimetre, and would express the measurement as, for 
example, 1.274 metres (to three decimal places). 


Figure 1.1 A tape measure 


If we know that a real number z is 1.274 to three decimal places, then x lies 
between 1.2735 and 1.2745; that is, x lies in the interval 


(1.2735, 1.2745]. 


This ‘closed interval’ notation represents the set of real numbers between 
1.2735 and 1.2745, inclusive of the endpoints. We can actually say slightly 
more, for we know that x is not 1.2745, since that would round up to 1.275. 
So x is in fact in the interval 


[1.2735, 1.2745), 


which represents the set of real numbers between 1.2735 and 1.2745, with 
1.2735 included, but 1.2745 not included. Other, similar, ‘round bracket’ 
notations are also used. 


(a, b] means all real numbers x between a and b, 
with a and 6 both included. 


(a,b) means all real numbers x between a and b, 
with a included and 6 excluded. 


(a, b] means all real numbers x between a and 6, 
with a excluded and 6b included. 


(a,b) means all real numbers x between a and 8, 
with a and b both excluded. 


The interval [a, b] is sometimes referred to as a closed interval, the interval 
(a,b) as an open interval, and the intervals [a,b) and (a,b] as half-open 
intervals. As you will see later, interval notation is useful in expressing 
domains of functions and in discussing the accuracy of calculations, and 
sometimes it is important to be able to say whether or not the endpoints 
are to be included. 


It is common scientific practice to quote measurements with ‘error bounds’. 


When considering problems 
arising from the real world, 
we shall generally quote 
numbers to relatively few 
significant figures. When 
discussing numerical 
methods, a larger number of 
significant figures will be 
used. The number of 
significant figures used will 
depend on the context. 


Note the round bracket on 
the right. 


that is,a<a<b 


that is,a<a<b 


that is, a<a<b 


that is,a<a<b 


We might write, for example, 


y = 32.62 + 0.08m 


to indicate that the observed measurement of y is 32.62m, but this can be 
expected to be accurate only to within an error bound of 0.08 m; that is, 
the actual value of y may lie anywhere between 32.62 — 0.08 = 32.54m and 
32.62 + 0.08 = 32.70 m, inclusive (i.e. anywhere in the interval (32.54, 32.70]). 
Such error bounds may not fit exactly with expressing y to a particular 
number of decimal places or significant figures (see Exercise 1.1(b)). We can 
also express the information that y is within 0.08m of 32.62m by writing 


—0.08 < y — 32.62 < 0.08, 
or more succinctly by writing 
|y — 32.62] < 0.08. 


Remember, |x| is a non-negative number with the same magnitude as z. So, 
for example, |5.72| = 5.72, while |—3.8907| = 3.8907. 


“Exercise 1.1 
(a) Express the following numbers in scientific notation. 
(i) 64823.5 (ii) 0.000 073 


(b) Suppose that we know that a measured value y is 


y = 127.683 + 0.006 m. 


(i) Give an interval in which the number y must lie. 
(ii) To how many significant figures can we give y with certainty? 


(c) Suppose that the number zw satisfies the condition 
|2 — 2.763] <5 x 107%. 


What is the smallest interval in which this condition tells you that x 
must lie? To how many decimal places can we give x with certainty? 


2 Some standard functions 


Functions play a central role in mathematics. After a brief look at some 
general ideas about functions (in Subsection 2.1), this section reviews some 
key classes of functions (linear, quadratic and exponential). These form part 
of a ‘library’ of standard functions, central both to building models and 
to solving more complicated mathematical problems (such as differential 
equations). The section also looks briefly at how such functions may be 
combined. 


2.1 Functions, variables and parameters 


Diagnostic test 2.1 


If you are familiar with each of the following terms (and the distinction 
between them), you may proceed directly to Subsection 2.2. If not, it is 
advisable to read this subsection. 


(a) Continuous model, Discrete model. 
(b) Variable, Parameter. 
(c) Domain, Image set. 


Section 2 Some standard functions 


We refer to |x| (read as 
‘mod «’) as the modulus 
(or magnitude or absolute 
value) of z. 


Note that in this course we 
use the convention of placing 
a * against not-to-be-missed 
exercises. 
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Consider the following example. At midday on 1 June, a reservoir contains 
2 x 10° cubic metres of water. Each day, 25000 cubic metres of water are 
removed from the reservoir, while only 3400 cubic metres flow into the reser- 
voir, and it is expected that these conditions will continue for 50 days. If 
there are no other factors affecting the quantity of water in the reservoir, 
then there is a daily net reduction of 21600 cubic metres of water. Assum- 
ing that the rate of reduction is exactly the same at all times, this means 
that the quantity of water in the reservoir reduces by 21 600/(24 x 60) = 15 
cubic metres each minute. Suppose that, at a time ¢ minutes after midday 
on 1 June, the reservoir contains V cubic metres of water. Then we might 
use the equation 


V=2x10°—-15t (t€R, 0<t < 72000) (2.1) 


to model the quantity of water in the reservoir for the 50 days (= 72000 
minutes) after midday on 1 June. The letters V and t represent measurable 
quantities. We call V and t variables. Here, V depends on t, and we call 
V the dependent variable and ¢ the independent variable. 


A different way to approach the same problem is to form a recurrence 
system as follows. Denote the volume of water in the reservoir at the end 
of minute i as V;. The initial volume of water is 2 x 10° cubic metres, and 
we denote this as Vo. Then (using the above arithmetic) we see that 


Vo=2x10°, Vianu=Vj—-15 (i =0,1,2,...,72000). (2.2) 


You may have met such recurrence systems before, and recognize that (2.1) 
is the closed form solution. 


Equations (2.2) relate two variables: volume (V;) and time (7). However, in 
(2.2) the independent variable 7 is constrained to take only integer values 
(between 0 and 72000), while in (2.1) the independent variable t may take 
any real value (between 0 and 72000). We call (2.1) a continuous model, 
while (2.2) is a discrete model. 


A function is a process that can be applied to each of a specified set of input 
values to produce an output value. One example is ‘given t between 0 and 
72000, calculate 2 x 10° — 15t’. If we denote this function by f, then we can 
write Equation (2.1) as V = f(t). The domain of a function is the set of 
permitted input values. The function f associated with Equation (2.1) has 
as domain the set of real numbers t with 0 < t < 72000; that is, the interval 
(0, 72000]. If we were to associate Equations (2.2) with a function g, say, 
then g would have as domain the set consisting of those integers 7 between 
0 and 72000. The image set of a function is the set of output values. The 
function f associated with Equation (2.1) has as image set the set of values 
of 2 x 10° — 15t for 0 < t < 72000; that is, the interval [920 000, 2.000 000}. 


In this course we shall usually be concerned with continuous models, and 
with functions whose domain is R, or a part of R such as an interval. We 
may specify such a function, say f, by writing it in a form such as 


f@)=2* 10° =15t (0 <¢ < 72000), 


where the expression 2 x 10° — 15t on the right-hand side gives the rule or 
formula that specifies the function, and the bracketed conditions indicate 
its domain. 


To define a function, a process must produce a unique output value for each 
allowed input. 
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Since in MST209 we are 
almost always concerned with 
continuous models, we shall 
usually omit ‘t € R’ from the 
bracketed conditions. 


So, for example, 
f(a) =4VE (x20) 


does not define a function f, since +/x does not specify a unique value 
(given x). 


Now consider a generalization of the situation described by Equation (2.1). 
Suppose that the reservoir initially contains Vo cubic metres of water and 
that the net loss per minute is L cubic metres. Then we have 


V=V-Lt (0<t< 72000). (2.3) 


We now have an equation involving several letters, with differing roles. As- 
suming that we want to use (2.3) to describe how V will change with time t, 
we continue to call t the independent variable and V the dependent vari- 
able. The quantities Vo and L do not depend on t. They may, however, take 
different values in different uses of (2.3) — in an application to a different 
reservoir, for example. We call Vo and L parameters. Whatever the values 
of the parameters Vo and L, Equation (2.3) gives a similar form of relation- 
ship between V and t: for example, the independent variable t appears in a 
similar way in any of the expressions 12000 — 5t, 300 — 6.6¢ and 14 — 2t. 


In this course we shall often be concerned with relationships between vari- 
ables, and it will be convenient to use language and notation that blur the 
abstract idea of a function. For example, we may write V = V(t), rather 
than introducing a separate symbol (such as f) for the function relating V 
to t, and say ‘V is a function of t’. Then, for example, V(3) denotes the 
value of V when t = 3. 


Exercise 2.1 


Suppose that, at midday on 1 May, a reservoir is at 60% of its capacity. The 
forecast for the next 50 days suggests that 30 000 m?° of water will be removed 
each day for consumption, while only 15 600 m? will be added to the reservoir 
each day, so that on average the quantity of water in the reservoir reduces by 
(30 000 — 15600) /(24 x 60) = 10 cubic metres each minute. Crisis measures 
will be introduced when the reservoir falls to 20% of its capacity. 


Let V (measured in m°) be the volume of water ¢ minutes after midday on 
1 May, and let C (measured in m?) be the reservoir’s overall capacity. The 
volume of water can be modelled using 


V=VW-10t (0<t< 72000), 
with a suitable choice of Vo. 


Determine a suitable expression for Vo, and hence use the model to obtain 
an expression in terms of C’ for the time at which crisis measures will be 
needed (according to this model). 


2.2 Linear functions 


Diagnostic test 2.2 


Read the description of the problem below, and do Exercise 2.2. Then do 
Exercise 2.3 on page 13, and check your answers with the solutions given on 
page 53. If you are happy with your answers, you may proceed directly to 
Subsection 2.3. 


Section 2 Some standard functions 


We write +,/z to denote the 
positive and negative square 
roots of « because, by 
convention, \/x denotes only 
the positive square root. 


The notation m? is shorthand 
for ‘cubic metres’. 
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Unit 1 Getting started 


A linear function relating y to x is one of the general form 
yY=Mr+C, 


where m and c are constants. The graph of such a function is a straight line 
(hence the term ‘linear’), as in Figure 2.1. The constant c represents the 
value of y at the point where the line crosses the y-axis. The gradient (or 
slope) of the graph is the same everywhere, and is equal to m. That is, for 
any two points (21, y1) and (22, y2) on the graph, we have 


y2— VY 


TQ — LY 


(X, Yo) 


: Vy Nh 
radient m = tan gq = ——— 
g g ae 


Figure 2.1 The graph of y= ma+c 


One situation where linear functions arise is when an object is moving in a 
straight line with constant speed. Let us look at an example. 


A boat, suspected of carrying contraband, passed a detector buoy 2 kilome- 
tres from port at 11.00 pm, and is moving at a steady 5 metres per second on 
a straight course directly away from port (along the line AZ in Figure 2.2). 
A coastguard cutter leaves the port in pursuit at midnight, travelling at 
7 metres per second. When will it catch the boat? 


Figure 2.2 


Suppose that we choose to measure time in seconds, starting from midnight, 
and distance in metres, measured from A. Let X metres be the distance 
of the coastguard cutter from A at time t seconds after midnight, and let 
Y metres be the distance of the boat from A at the same time. We can 
readily obtain an expression for X in terms of t, since X = 0 when ¢ = 0, 
and the cutter travels at a constant speed of 7 metres per second: we have 


X = Tt. 


We also want an expression giving Y in terms of t. We know that (at 
point B) Y = 2000 at 11.00pm, which is 1 hour, or 60? seconds, before 
midnight and so corresponds to t = —3600. Also, as the boat is moving at 
a constant speed of 5 metres per second, Y will be related to t by a linear 
function of the form 


Y=5t+ec. 
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In this course, if no domain is 
specified for a function, 
assume it to be R. 


The notation ms~! is 


shorthand for ‘metres per 
second’. 


In this course, we shall 
usually use SI units. The SI 
units of distance and time are 
metres and seconds (denoted 
by m and s, respectively). 
Commonly used SI units are 
given in the Handbook. 


*Exercise 2.2 


(a) Find the value of c such that Y = 5t +c satisfies the condition Y = 2000 
at t = —3600. 


(b) (i) When will the coastguard cutter catch the boat? 


(ii) In the direction that the boat is travelling, the limit of territorial 
waters is 100 kilometres from A. Will the cutter catch the boat within 
territorial waters? 


Simultaneous linear equations 


Suppose that we know the paths of two aircraft, each of which is travelling 
in a straight line, and wish to know where these paths cross. This is one 
of a wide variety of situations where we need to find the intersection of two 
straight-line graphs, which is equivalent to the algebraic problem of solving 
two simultaneous linear equations. Consider, for example, the following 
linear equations (see Figure 2.3): 

Ag + 3y = —1, (2.4) 

3x +y = 3. (2.5) 
There are many ways of solving these equations: one quick method is 
Gaussian elimination. Let us see how this works for (2.4) and (2.5). 


The aim of the method is to subtract a multiple of the first equation from 
the second in order to eliminate the x terms. First, we multiply (2.4) by 3 
to obtain an equation with the same coefficient of x as in (2.5): 


3x 4dr +3 x 3y = 3(-1), 
which simplifies to 

3a + fy = —#. (2.6) 
2.6) from (2.5), to eliminate x, and obtain 


OA 3) 15 
7 ed oe ar 


Now we subtract 


gaa- FS 


y-qy=3 
that is, —3y = ». so y= —3. 
To find x, substitute this value of y into (2.4), to obtain 
4x + 3(—3) = -1, 
which gives 4a = —1+9= 8, and hence x = 2. 
So the solution of Equations (2.4) and (2.5) is = 2, y = —3. 


*Exercise 2.3 


Use Gaussian elimination to solve the equations below for u and v: 
2u — 5u = 19, 
3u + 4v = —29. 


2.3 Quadratic functions 
Diagnostic test 2.3 


Section 2 Some standard functions 


Figure 2.3 


These equations are linear, 
since they can be rewritten in 
the form y = ma +c. 


This method was known to 
Chinese mathematicians in 
about 100 Bc, but not to 
European ones until it was 
discovered by the German 
mathematician Carl Friedrich 
Gauss (1777-1855). (‘Gauss’ 
is pronounced as ‘gowce’.) 


You may like to check these 
values, by substitution 
into (2.4) and (2.5). 


Do Exercise 2.4 on page 15, and check your answers with the solutions on 
page 53. If you are happy with your answers, you may proceed directly to 


Subsection 2.4. 
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Unit 1 Getting started 


A quadratic function relating y to x is a function of the general form 
y = az’ +bat+c, 


where a, b and c are constants and a #0. The graph of such a quadratic 
function is a parabola, and may open ‘up’ or ‘down’, depending on the sign 
of a, as illustrated in Figure 2.4. 


av 


ay 


(a) y=2x° +3x-4 (b) y=? + 4x 415 
Figure 2.4 


Where the graph opens ‘up’ (a > 0), values of y may become arbitrarily 
large, but there is a smallest (minimum) value that y can take. If a < 0, then 
the graph opens ‘down’, and negative values of y may become arbitrarily 
large in magnitude, but there is a largest (maximum) value that y can take. 


For example, suppose that a ball is thrown directly upwards at time t = 0, 
with velocity 10ms~', and from a height of 2 metres (see Figure 2.5). 


The ball moves under the influence of gravity, and the position s after t sec- 
onds is given by 


s = 4(—9.81)t? + 10¢ + 2. 


Suppose that we want to find when the ball will hit the ground; that is, the 
value of t when s = 0. Then we need to solve the quadratic equation 


5(—9.81)t? + 10t +2 =0. (2.7) 


You will have met before the formula for the solution of a general quadratic 
equation, given below. 


Solution of a quadratic equation 
The quadratic equation 
ax? + br +c=0, 


where a, 6 and c are constants and a ¥ 0, can be solved for x using the 
formula 


—b+ Vv b2 — 4dac 
2a ; 
The solutions of a quadratic equation are often referred to as its roots. 


(2.8) 


jt — 


Notice that the sum of the roots is —b/a, which is a useful check. 


Using the formula to solve (2.7) for t gives 


—10+ 100 + 39.24 
i= 9 —- = 2.22 or —0.18 (to two decimal places). 
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SA 


g=9.8l ms” 


he =10ms"! 


2m 


Figure 2.5 


This equation will be derived 
in Unit 6. 


The term ‘root’ is also used 
for a solution to other sorts of 
equation, as you will see in 
Section 4. 


Here the solution t = —0.18 refers to a time before the ball is thrown, so can 
be discarded. The ball hits the ground about 2.2 seconds after it is thrown. 


In this example, the quadratic equation has two solutions. Look at the 
graphs in Figure 2.4, and imagine moving them up and down (which corre- 
sponds to varying the value of c). The x-axis may meet a quadratic graph 
in two places, or not at all, or it may happen just to touch the minimum 
(or maximum) point of the graph (see Figure 2.6). In formula (2.8), we 
need to find the square root /b? — 4ac. If b? — 4ac > 0, then we find a 
real value, greater than 0, for this square root, and there are two different 
solutions to the quadratic equation. If b? — 4ac = 0, then there is just one 
solution (though, for reasons given below, this one solution is sometimes 
considered as two equal solutions). If b? — 4ac < 0, then there are no (real) 
solutions. The quantity b? — 4dac is often referred to as the discriminant of 
the quadratic equation because it discriminates between the cases shown in 
Figure 2.6. 


YA YA 


Section 2 Some standard functions 


Complex numbers, which are 
discussed in Section 4, enable 
us to express square roots of 
negative numbers and hence 
to produce (complex) 
solutions to a quadratic 
equation when b? — 4ac < 0. 


YA 


b?-4ac>0 b?- 4ac =0 


Figure 2.6 


*Exercise 2.4 


Solve for x the following equations. 
(a) 227+ 72 -—4=0 (b) «7 +2-6=0 


Sometimes, you may find that you need to solve a quadratic equation where 
the coefficients are letters rather than numbers. 


Exercise 2.5 
Show that the solutions (for x) of 


ma? +2ka+mw*=0 (m0) 
are x = —K + VK? —w?, where K = k/m. 


The solutions of a quadratic equation correspond to a factorization of the 
corresponding quadratic function. For example, x? + 2 — 6 = 0 has solutions 
x = 2 and x = —3, and we have the factorization 


a? +2 —6= (a —2)(x+3). 
With experience, you may find that such factorizations provide a convenient 


way of solving some quadratic equations, but the formula provides a reliable 
method that can be used in all cases. 


b?-4ac <0 


You may like to check this by 
multiplying out 
(a — 2) x (x + 3). 
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Unit 1 Getting started 


One point of caution: if you want to factorize a quadratic function, you can 
do this by first solving the equation (e.g. by using formula (2.8)), but you 
need to be careful to match the coefficient of x? in the original quadratic 
function with that in the factorization. For example, 2x? + 7x — 4 = 0 has 
solutions 7 = 5 and « = —4, but to factorize 2%? + 7x — 4 we write 


Qn? + 7x —4 = 2a — §)(a +4) = (22 -1)(x +4), 
where the 2 is needed to ensure that the coefficients of x? are the same on 
each side. 


There are some particular factorizations that it is helpful to recognize. Two 
useful ones are 
(n+ Al? =a74+2Ar+ A2 and (#2— A)? =22-2Ar4 A’. 


So, for example, 2? — 62 +9 = (x —3)?. We refer to such quadratics as 
perfect squares. Perfect squares correspond to quadratic equations in 
which the discriminant is b? — 4ac = 0. (You may like to check this for 
yourself.) Thus equations in which the discriminant is zero can be written in 
the form (a + A)(x+ A) = 0 or (a — A)(a — A) = 0, and these factorizations 
lead us sometimes to consider such equations as having two equal roots 
x =—Aand « = —A, or x= A and x = A, rather just one root. 


Another useful factorization is 
(2 + A)(a — A) = a? — A?. 


So, for example, x? — 16 = (x + 4)(x — 4). We refer to such a quadratic as 
a difference of two squares. 


One needs to be particularly careful when solving a quadratic equation that 
involves the same letters as appear in the standard formula (2.8), but in a 
different way. 
Example 2.1 
Solve for x the equation 

abx? — (a+b)r+1=0, 


where a and b are non-zero constants. 


Solution 
You need to keep a cool head here, because the letters used in formula (2.8) 
are used in a different way in the given equation. In (2.8), we need 
ab for a, —(a+b) for b, 1 tor:¢, 
So we obtain the solutions 
at+b+./(a+b)? —4ab 
2ab ; 
This expression gives the solutions, but it turns out to be possible to express 


them in a much simpler form. We have (a+b)? = a? + 2ab +7, so the 
discriminant can be written as 


(a+b)? — 4ab = (a? + 2ab + b*) — 4ab = a? — 2ab+ b* = (a —b)’. 


Therefore 
a+b+./(a+b)? — 4ab atb+ (a — b)? a+b+(a-—b) 
2ab ~ 2ab 2ab 
Now (a+ 6) + (a — b) = 2a and (a+ b) — (a — b) = 2b, so the two solutions 
are 1l/aand1/b. & 
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Note that if we allow A to be 
positive or negative, then 
both cases can be written as 


(2 + A)? = 27 +2Az+ A’. 


You will find it advantageous 
in this course to be able to 
perform manipulations like 
this by hand. If you find 
them difficult, however, you 
may like to make use of the 
computer algebra package for 
the course. 


2.4 Exponential and logarithm functions 


Diagnostic test 2.4 


Do Exercises 2.6 and 2.7 below, and check your answers with the solutions 
starting on page 53. If you are happy with your answers, you may proceed 
directly to Subsection 2.5. 


A function relating y to x of the form y = ba® (where a and 0 are constants, 
with a > 0 and a #1) is referred to as an exponential function. Use 
of such a function with domain R requires us to assign a meaning to a” 
for non-integer values of x. We start by revising the properties of integer 
powers. 


Powers 


You will be familiar with the meaning of a positive integer power of a num- 
ber, such as 10° = 10 x 10 x 10 x 10 x 10. In general, a” means the product 
of n copies of a (for any real number a and any positive integer n). In 
particular, a! = a. 


For positive integers m and n, we have the property 


nag". (2.9) 
since each side is the product of n+ m copies of a. For example, 

10? 10° =10", 
Consequently, if we multiply m copies of a”, we obtain 

m times 
————-— 
oR ee ee Sa 
m times 

that is, 

(a")™ =a™™™, (2.10) 


For example, (107)? = 10°. 


The definition of a” can be extended to cases where n is not a positive 
integer by assuming that (2.9) and (2.10) hold more generally. For a 4 0, 
this assumption leads to the definition of a° as 1, and a~” as 1/a”; and, for 
a > 0, to the definition of a!/”" as the nth root of a, and a™/” as the nth 
root of a”. So, for example: 


10-* = 1/10* = 0.0001; 
27/3 = 427 =3 (since 3° = 27); 

1 1 1 1 
43/2 7B /64 8° 
It is conventional to take fractional powers to mean positive roots (where 
there is a choice). So, for example, 9/2 — \/9 means 3 rather than —3. 
In general, roots of negative numbers do not necessarily exist (at least, 
not as real numbers); but where they do, we use the same notation. So, 
for example, (—27)!/3 = ¥/—27 = —3 (since (—3)? = —27, and there is no 
positive cube root in this case). 


4-3/2 


Section 2 Some standard functions 


In a”, a is called the base, 
and n may be referred to as 
the power, the index or the 
exponent. 


Recall that the nth root of a 
number a is a number Db such 
that b” = a, and we write 


b= Ya. 


The negative square root of 5, 
for example, would be written 
as —5//2 or —V5. 
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Unit 1 Getting started 


We can define a® for a > 0 and for irrational values of « by means of a 
limiting process that need not concern us here. (The value of a® for any 
particular a > 0 and x can be found using your calculator.) This definition 
of a” leads to the following properties of powers that hold for all real numbers 
a > 0 and all real exponents x and y: 

a” >0, 

a” =a”, 

at t¥ = aq? x a’, 

(a*)Y =a, 

a fae =a", 
Finally, note that for powers of a product or a quotient, we have 

(aby =e" (ajby =a 7b". 
For example, 15’ = 35’ and (5/3)4 = 54/34. 


and 


*Exercise 2.6 


Use the properties of indices to simplify each of the following. 
(a) a’a? —(b) a®/a?_—(c) (a)? (d) (277)? x 48 
(ec) 8/8 (f) 16/4 (g) (4)°? (a) (164) 17? 


The exponential and logarithm functions 


One function that is particularly important is e”, where e is the number 
2.71828... . This function arises, for example, in the solution of differential 
equations. This is a consequence of its property that it is unchanged by 
differentiation. (Indeed, the only functions f that are unchanged by differ- 
entiation are functions of the form f(x) = Ae*, where A is a constant.) The 
function e*” is often referred to as the exponential function, and may also 
be written exp. Note that e” is always positive: e” > 0 for all real x. 


The inverse function of e” is the natural logarithm function, written Ina. 
Now In is defined only for x > 0 (since the domain of the inverse function 
of e” is the same as the image set of e”). Since these functions are inverse 
to each other, we have: 

In(expx) = for all real z; 

exp(Inz) =a for all real x > 0. 


Another way of looking at this relationship is: if e? = x, then y = Inz. In 
particular, since e? = 1, we have In1 = 0. 


The properties of powers given above lead to corresponding results about 
the logarithm function: 

In(1/u) = —Inu, 

In(u x v) =Inu+Inv, 

In(u’) =v x Inu, 

In(u/v) =Inu—Inv. 
Before the advent of calculators and computers, these properties were com- 
monly used in calculating powers, reciprocals and products ‘using loga- 
rithms’. Such applications are no longer needed, but these properties of 


logarithms are still important in the manipulation of expressions involving 
exponentials and logarithms. 
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For a = 0, 0* is taken to 
equal 0. For a < 0, the 
definition of a” involves 
complex numbers and need 
not concern us here. 


We have not proved these 
properties, but we shall make 
use of them as necessary. 


Differential equations occur 
throughout the course, 
beginning in Unit 2. 
Differentiation is discussed in 
Section 5. 


Essentially, the inverse 
function of a function f is 
one that reverses the effect 
of f. 


This can be taken as a 
definition of Ina. 


Another logarithm function 
that will be used occasionally 
in the course is log), where 
if 10% = a, then y = logy, 2. 
The results given here for In 
also hold for logy . 


Section 2 


*Exercise 2.7 

Simplify each of the following (where a > 0,b >0 and a > 0). 

(a) n7+n4-Inl4 (b) Ina+ 2Inb-—In(a’b) 

(c) e* x (e¥)? + e* (d) In(e® x e¥) (e) es ().e-7 


(g) exp(2Inz + In(z + 1)) 


Exercise 2.8 


By taking logs of both sides of the equation 


where a > 0, show that we can find a value for k so that this equation holds 
for all values of «x. 


In light of the equivalence established in Exercise 2.8, it is common practice 
to use functions of the form e*”, for suitable values of k, rather than expo- 
nential functions of the form a® for values of a (> 0) other than e. (This is 
more convenient when doing calculus, for example.) For k 4 0, the graph 
of this standard exponential function takes one of the forms shown in Fig- 
ure 2.7. For k > 0, the larger the value of k, the faster the value of e** 
increases as x increases (and so the graph climbs more steeply). Similarly, 
for k <0, the larger the magnitude of k, the faster e** decreases. 


YA yA 


Figure 2.7 Graphs of y = e** 


Log plots 


Suppose that you have data on some quantity y at various times t, and you 
believe that y is an exponential function of t. To test such a hypothesis, you 


Some standard functions 


can plot Iny against t. If such a plot gives a straight line, then this confirms Such plots are often referred 
that y is of the form Ae*’. For example, suppose that a plot of Iny against to as log—linear plots. 


t suggests the linear relationship 
Iny = 1.47t + 3.82. 
Then, taking exponentials of each side, we have 
y = exp(1.47t + 3.82) 
= exp(1.47t) x exp(3.82) 
=Ao.6e™, 


The next exercise shows how we can test data (on variables x and y) for a 
different form of relationship by plotting In x against In y. 
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Unit 1 Getting started 


Exercise 2.9 
(a) Suppose that 


Iny = 2.83 ln a + 0.87. 
Using the properties of exp and In, express y as a function of z. 
(b) In general, suppose that 
Iny=alnx+6, 


where a and 6 are constants. What form of relationship is there between 
y and x? 


From Exercise 2.9(b), we see that if a plot of Iny against In z is linear, then 
y is a power function of x; that is, 


= a 
YrCcr , 


where c and a are constants. Note that this is not an exponential function. 
Power functions of 2 include 


a, 2°, 2°/? (= 2*/2). 


If a is a positive integer, then the power function f(x) = x° is defined for 
all real 2; but for other values of a, this power function has domain x > 0. 


2.5 Combining functions 


Diagnostic test 2.5 
(a) If f(z) =Inz and g(x) = 1/(a — 1)”, and x > 1, find the following. 
(i) F(g(@)) (ii) 9 F(@) 
(b) Express h(a) = (1 + e”)? as a composition of basic functions. 
Now check your solutions against those given below. If you are happy with 


the answers, you may proceed directly to Section 3. 


Solution 
(a) (i) (g(x) = nA /(# — 1)*) = —2In(@ — 1) 
(ii) g(f(x)) = 1/(Ina — 1)? (This has no easy simplification.) 
(b) We can obtain h(x) in three steps. 
Step 1 Start with e*. 
Step 2. Add 1 to the result of Step 1. 
Step 8 Apply the square function to the result of Step 2. 
Then h(x) = p(q(r(x))), where 


p(x) =27, g(a) =2+1, r(x) =e?. 


Consider the motion of a stone that is thrown vertically upwards with a 
velocity of 10ms~! from a point 2m above the ground (see Figure 2.5 on 
page 14). We shall see in Unit 6 that the height s of the stone in terms of 
time t may be modelled by the equation 


s=—ggt? + 10¢ +2, (2.11) 
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Plots of Iny against In « are 
called log—log plots. 


In an exponential function, x 
appears as the exponent, as 
in for example 

2. 3", 2:5", 
each of which can be 
expressed as e** for a suitable 
value of k. 


and that its velocity is given by 

v=10—gt, (2.12) 
where g = 9.81 ms~? is the acceleration due to gravity. 
Now suppose that we wish to know the velocity as a function of height 
(v = v(s)). By manipulating the two equations above, eliminating t, you 
can see that 

v? = 4g + 100 — 2gs, 


of which one solution is 


v = V/4g + 100 — 29s. (2.13) 
However, there is now potential for confusion about what is meant by v. In 
Equation (2.12), v = v(t) is a function of time t; in Equation (2.13), v = v(s) 
is a function of height s. 

Unfortunately, this notation suggests that we are using the same ‘name’, v, 
for two different functions. (The expressions on the right in (2.12) and (2.13) 
are different!) In such a context, it may be necessary, for clarity, to introduce 


different names for these functions. In fact, we could resolve the situation 
by solving Equation (2.11) for t, giving 


10 — /100 + 4g — 2 
b= ag ee f(s), say. (2.14) 
g 


Now Equation (2.12) is of the form v = g(t) (where g(t) = 10— gt). Then 
Equation (2.13), which shows the dependency of v on s, can be written as 
v= 9(f(s)). 


In general, the function h with the rule 


h(x) = g(f(@)) 


is called a composite function; it is the composition of the functions g 


and f. 


When combining functions in this way, it is important to check how the 
domains affect each other. In the above example, (2.14) is valid only when 
100 + 4g — 2gs > 0, and we can see from (2.13) that the same condition 
must hold in order to find the composite function v = h(s). 


You will see later that when doing calculus it is useful to be able to form 
compositions of functions, and even more useful to be able to recognize a 
complicated function as the composition of simpler ones. 


Example 2.2 

If f(x) =e” (x ER) and g(x) =1+ 2? (x € R), what are the following? 
(a) g(f(w)) — (b) F(g(@)) 

Solution 

(a) g(f(x)) = g(e*) =1+ (e*)? =1+e” 

(b) f(g(x)) = f(1+ 2?) = 

Notice that g(f(x)) and f(g(x)) are different. The function g(f(x)) is ‘apply 
f first, then g’, while f(g(x)) is ‘apply g first, then f’; the order in which f 
and g are applied matters! 


Section 2 Some standard functions 


Here we take the positive 
square root, which 
corresponds to the upward 
motion of the stone. Unit 6 
will deal with both cases. 


Again we must be careful 
about the sign of the square 
root, so that the equation 
corresponds to the upward 
motion of the stone. 


In this example, both 
functions f and g are defined 
for all x in R, so we need not 
worry about domains. 
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Example 2.3 
Express the function 
1 
e) = ———__- (2.15) 


(VIF Re)" 


as a composite of a quadratic function and a power function. 


Solution 
Note first that 1+ 22? is a quadratic function and that, writing y = 1+ 227, 


1 
the right-hand side of (2.15) becomes a) 
we can obtain h(x) in two steps. v¥ 


z= y— 3/2 (a power function). So 


Step 1 Calculate y = 1+ 22”. 
1 
Step 2. Apply ——~ = y~?/” to the result of Step 1. 


(vy)? 
So if f(x) = 1+ 22? and g(x) = x~ 3/2, then h(x) = g(f(x)). Ml 


Exercise 2.10 

(a) If f(x) =e-* and g(x) = 1 — 2°, find the following. 
(i) Flo) — Gi) 9 F@) 

(b) Express 
1 

(4+ 907)4 


as a composite of a quadratic function and a power function. 


Alt) = 


3 Trigonometric functions 


In this section, we add another class of functions to the ‘library’ developed 
in Section 2. These are the trigonometric functions. They originate in the 
geometry of right-angled triangles, but in this course we are equally often 
concerned with their use in modelling repetitive or oscillatory behaviour. In 
particular, they arise as solutions of certain differential equations. 


3.1 Introducing the trigonometric functions 


Diagnostic test 3.1 


Do Exercise 3.1 on page 23, and check your answers with the solutions given 
on page 54. If you are happy with your answers, you may proceed directly 
to Subsection 3.2. 


You will have met sin@ = a/h, cos? = b/h and tan@ = a/b as ratios in a 
right-angled triangle (see Figure 3.1). However, these definitions of the sine, 
cosine and tangent functions work only for 0 < @< 4. (Note that we shall 
almost always express angles in radians in this course.) 
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Here, the domain of g is 
x > 0, but since 

f(x) =1+ 22? is always 
greater than 0, there is no 
problem. 


The Greek letter @ is read as 
‘theta’. The Greek alphabet 
is given in the Handbook. 


Recall that 180° = 7 radians. 


To define the sine and cosine functions for a general value of 6, we can 
use Figure 3.2, which shows a circle of radius 1. Imagine that the line OA 
started along the z-axis, and was then rotated anticlockwise through an 
angle 6. Then the point A has coordinates (cos 6, sin @). Here 9 may be any 
value, positive or negative. (A negative value of 6 corresponds to a rotation 
clockwise. ) 


ay 


Figure 3.1 Figure 3.2 

If we rotate through 27 radians (360°), then we go round a full circle. So 
rotations of 6 and 6+ 27 leave A in exactly the same place. This leads to 
the repetitive nature of the graphs of sin and cos (see Figure 3.3). The word 
periodic is used to refer to the fact that these functions repeat their values 
every 27: that is, sin(@ + 27) = sin @ and cos(6 + 27) = cos 9, for any 6. 


Section 3 Trigonometric functions 


(a)y=sin q (b) y=cos q 


Figure 3.3 


Other trigonometric functions can be defined in terms of sin and cos. You 
will have met tan # = sin@/cos 6. This is defined for all real 6 except where 
cos? = 0 (ie. at 0= +5, ar and so on). You may also have met 


and cotd= 


1 
os 0” sin 0 tan@ = sind” 
We need to restrict the domains of cosec and cot to exclude points where 
sin 9 = 0, and the domains of sec and tan to exclude points where cos 6 = 0. 


1 cos 0 
sec 6 = cosec 9 = = 
c 


*Exercise 3.1 

(a) Find the values of sin@ and cos 6@ for = 0 and 6 = $. 
Hence find the values of tan@, sec@, cosec@ and cot @ for 6=0 and 
6 = 5, where they are defined. 


(c) Two right-angled triangles are shown in Figure 3.4. Use these to calcu- 
late the values of sin 6, cos 9, tan 0, cosec 0, sec @ and cot @ for 6 equal to 
each of ¢, 7 and 4. 


For what values of 0 is sin@ = 0? (Refer to Figure 3.3(a).) 


These functions may be 
referred to as tangent, 
secant, cosecant and 
cotangent. 


Figure 3.4 
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The function tan has the graph shown in Figure 3.5. Notice that tan ac- 
tually repeats its values every 7. (This is because sin(@ + 7) = —sin@ and 
cos(@ + 7) = —cos8, so that tan(@+ 7) = tan 0.) 


Ya 
y=tang . 
The graphs of sec, cosec and 
cot are given in the 
Handbook (as well as those of 
sin, cos and tan). 
| | 
3m _ t t 3n ia 
3 7 Pee 2 ” = 4 


Figure 3.5 


3.2 Inverse trigonometric functions 


Diagnostic test 3.2 

(a) Find all the solutions of cos @ = 0.5 in the range —27 to 47. 

(b) Find all the solutions of tan@ = 1//3. 

Now check your solutions against those given below. If you are happy with 


the answers, you may proceed directly to Subsection 3.3. 


Solution 


(a) In the range —7 to 7, we know that cos(+%) = 4 =0.5. Further solu- 
tions may be found by adding or subtracting multiples of 27 to these 
values: 


om tT mw St Ta 11a 
3? 39 3? 3? B? 3° 
(b) In the range 0 to 5, we know that tan § = 1/3. From the graph in 
Figure 3.5 we see that the required solutions are obtained by adding or 
subtracting any multiple of 7, so the solutions are 


gtnnr (neZ). 


Suppose that you need to solve for x the equation 


i 
COS f= om 


What solutions are there? You have seen (in Exercise 3.1(c)) that cos = 5, 
so one solution is certainly x = 4. There are others, however. For instance, 
since cos repeats its values every 27, another solution is x = 3 +27. We 


can find an infinite number of solutions by adding or subtracting multiples 


of 27 to/from %. There are even more solutions. If you look at the graph of 


“ 
cos in Figure 3.3(b), you can see that a horizontal line at y = 5 would cut it 
twice between 0 and 27: we also have cos _ — 5. And more solutions can 


be found by adding or subtracting multiples of 27 to/from an 
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In general, an equation of the form 


coor = y (3.1) 


is solved for a by finding a value of the inverse trigonometric function 
arccos: 


w = arccos y. 


However, we need to be careful here. Solutions of Equation (3.1) are not 
unique, as we saw for y = 5: If we reverse the roles of the axes in Fig- 
ure 3.3(b), we obtain the curve shown in Figure 3.6. However, this is not 
the graph of a function: a vertical line may meet the curve in many places, 
reflecting the fact that, given y, Equation (3.1) may have multiple solu- 
tions x. To ensure that, given y, arccosy has a unique value, we need to 
restrict the range in which values of arccos can fall. This is equivalent to 
specifying a codomain for the function arccos. The codomain of arccos is 
given in Table 3.1, together with codomains for two other inverse trigono- 
metric functions, arcsin and arctan. In Figure 3.6, when the values taken by 
arccos are restricted to this codomain, we obtain just the part of the curve 
shown in bold, which is the graph of a function. 


<v 


Figure 3.6 The graph of 
arccos is just the bold part of 


the curve 
Table 3.1 
Function Inverse Codomain of Domain of 
inverse function inverse function 
— id a bd Tv Tv 
y=sinx x=arcsiny =5 0 S95 -l<y<l 
y=cosx x=arccosy O<a<T -l<y<l 
— — Tv Tv 
y=tanx x=arctany Sgt a R 


Calculators and computer software can be expected to give values of the 
inverse trigonometric functions drawn from suitably restricted codomains, 
such as those in Table 3.1. However, in a particular model this may not give 
the appropriate value, and in such a situation it is important to be alert to 
the fact that an equation such as (3.1) actually has infinitely many solutions: 
there are usually two solutions in the range 0 to 27, together with infinitely 
many others obtained by shifting these two by multiples of 27. 


Exercise 3.2 
(a) Find all the solutions of sin @ = 0.8 in the range 0 to 67. 
(b) Find all the solutions of tan @ = 1. 


Section 3 Trigonometric functions 


The codomain of a function 
is a set within which its 
values must lie. 


The graphs of arcsin and 
arctan, as well as arccos, are 
given in the Handbook. 


Some texts use sin~', cos! 


and tan~! rather than arcsin, 
arccos and arctan. 


There is another possibility: 
if |y| > 1, then Equation (3.1) 
has no solutions. 
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3.3 Some useful trigonometric identities 


Diagnostic test 3.3 


Do Exercise 3.3 on page 27, and check your answers with the solutions given 
on page 55. If you are happy with your answers, you may proceed directly 
to Section 4. 


Figure 3.7 shows the relation between a clockwise rotation through 6 (re- 
garded as a rotation through —@) and an anticlockwise rotation through 0. 
Notice that such rotations lead to equal x-coordinates but to y-coordinates 


of opposite signs. So we have 
cos(—@) =cos@ and sin(—6) = —sin#@. 


These relations hold for all values of 0, and are examples of trigonometric 
identities. These can be useful in a variety of contexts, such as simplifying 
expressions involving trigonometric functions. 


To derive one particularly useful identity, apply Pythagoras’s Theorem to 
the right-angled triangle OAX in Figure 3.7. This leads to 


cos” 6 + sin? 6 = 1. (3.2) 
If we divide each side of (3.2) by cos? 6, we obtain the identity 

1+ tan? 6 = sec? 0. 
Similarly, if we divide each side of (3.2) by sin? 6, we obtain 

cot? 6+ 1 = cosec? 6. 


You may also have met previously the identities 


sin(@ + ¢) = sin @cos ¢ + cos Osin ¢, 
cos(@ + ¢) = cos@cos ¢ — sin @sin ¢. 


(3.3) 
(3.4) 


Replacing ¢ by —@ in these identities (and using cos(—¢) = cos¢@ and 
sin(—) = —sin @), we obtain 


sin(O — d) = sin @cos ¢ — cos Osin ¢, 
cos(@ — ¢) = cos@cos ¢@ 4+ sin @sin ¢. 


We can obtain an identity for tan(@+ ¢) by dividing those for sin(6 + @) 
and cos(6 + @): 


sin(@+¢) — sin@cos@+ cos@sin@ 


tan(@+ ¢) = 


cos(9+¢)  cos@cos¢—sin@sin@’ 
and dividing top and bottom by cos @cos @ gives 
tané+ tan @ 
t ta en a Me 
mee?) 1 — tan 6@tan @’ 


All these identities can be useful when manipulating expressions involving 
trigonometric functions. One situation where such manipulations are needed 
is when performing certain integrations by hand (as you will see in Section 6), 
and there expressions for sin26 and cos20 can be particularly useful. We 
ask you to obtain such expressions in the next exercise. 


(Note that it is usual to write sin20 rather than sin(20) — the spacing 
makes the meaning clear. However, when using the course computer algebra 
package you will have to include the brackets. When there is any danger of 
ambiguity, we shall also use brackets in the text.) 
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All the trigonometric 
identities discussed in this 
subsection are included in the 
course Handbook for easy 
reference. 


(cos q, sin q) 
A 


B 


ma) (cos(—q), sin(— q)) 


Figure 3.7 


Notice that we write (sin 0)? 
as sin? 0. 


Strictly speaking, this and the 
following identities hold only 
where the functions tan, sec, 
etc. are defined. 


These identities can be 
derived using transformation 
matrices, for example, but we 
shall not discuss here how 
this is done. 


Section 4 Complex numbers 


*Exercise 3.3 


(a) By putting ¢ = in the expressions for sin(@+ ¢) and cos(@+ ¢), 
establish the following identities. 


(i) sin 20 = 2sin 6 cos 6 (ii) cos 20 = cos? @ — sin? 6 


(b) Using trigonometric identities, and particular values of sin and cos, 
simplify each of the following. 


(i) sin(2z7 — 0) (ii) cos(27 — 0) (iii) sin(a — 6) 
(iv) cos(m — 6) (v) sin(5 — @) (vi) cos(5 — @) 


(vii) cos (37 + 2) 


The most useful identities to remember are (3.2), (3.3) and (3.4), as most 
of the others can be derived from these. 


4 Complex numbers 


Diagnostic test 4.1 


Do Exercises 4.1, 4.2, 4.3 and 4.7 below, and check your answers with the 
solutions starting on page 55. If you are happy with your answers, you may 
proceed directly to Section 5. 


Complex numbers provide a system within which we can solve any quadratic 
equation (and, indeed, any polynomial equation). They are helpful in some 
of the mathematical techniques introduced in this course, although the use 
we shall make of them is quite limited. 


There is no real number « satisfying the equation 
a? = 1 


However, there are circumstances where it is convenient to have a system 
of ‘numbers’ in which such an equation can be solved. Such a system is the 
system of complex numbers. A complex number is one of the form 
: . . Engineers commonly use j to 
z=a+bi (or, equivalently, z = a+ ib), represent +/—1. 
where 7 = —1, and aand bare real numbers. We refer to a as the real part 
of z, written Re(z), and to b as the imaginary part of z, written Im(z). Note that Im(z) is the real 
A complex number of the form a+ 0i is, in effect, just the real number a; number 6; Im(z) is not equal 
so the real numbers are seen as part of (a subset of) the complex numbers. *° bt. 


We denote the set of all complex numbers by C. Within C, we can solve any 
quadratic equation, since the formula will always give a solution once we 
can use \/—1. For example, the equation x? — 2x + 2 = 0 has the solutions 
2+ V22-4x2 2+ /-4 2+V4x V1 2+22 

cae 2 = 2 —% 

2 — 


=1+i, 


2. 


and the equation x —1 has the solutions x = +i. 
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An nth-order polynomial with real coefficients is a function of the form 
P(r) = nx” + dn1z”™ 1 +--+ +a12 + ao, 


where a, #0 and each coefficient a, (kK = 0,1,...,n) is a constant in R. 
Within the complex numbers, any such polynomial can be written as a 
product of a, and n factors of the form x — cz, (k = 1,2,...,n), with each 
cy in C. These n factors correspond to the n roots (i.e. solutions) 7 = cp 
(k = 1,2,...,n) of the corresponding polynomial equation p(x) = 0; if a fac- 
tor «—c occurs more than once, then the root « = c is a repeated root. 
In Subsection 2.3 we saw that for second-order (i.e. quadratic) polynomi- 
als, repeated roots correspond to perfect squares (i.e. factorizations such as 
a? —2cx +c? = (x —c)*). 


4.1 The arithmetic of complex numbers 


We can perform arithmetic (and algebra) with complex numbers, and this 
follows all the familiar rules for real numbers, such as 


u(vt+tw)=uv+uw and uxv=vxXu. 


To add, subtract or multiply complex numbers, just manipulate brackets in 
the usual way, and remember that i? = —1. For example, 
(2+ 31) + (4—7i) =24+4+4+3i-7i=6-4 
and 
(2+ 37) x (4—7i) = 2 x (4— Ti) + 37 x (4 — TZ) 
= 8— 14¢ + 121 — 217? 
= 8421-2 
= 29 — 21. 
Division of complex numbers is a little more complicated. The complex 
conjugate of a complex number z = a+ bi is Z = a — bi, and the rule for 
division is best expressed in terms of this. To simplify, for example, 
2+ 31 
4 — 73’ 
multiply top and bottom by 4+ 72, the complex conjugate of the denomi- 
nator, to obtain 
2+32  (2+3%) x (4+ 7i) 
A—Ti (4—Ti) x (44+ 7i) 
— 841414 121-21 


~ 16+ 28% — 281 + 49 
13 + 267 


This process always reduces the denominator to a real number, since 
(a + bi) x (a — bi) = a? +B? is always real. Thus, in general, 

c+di  (e+di) x (a— bi) 

at+bi a? + b2 : 
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An nth-order polynomial is 
sometimes referred to as a 
polynomial of degree n. 


In fact, this result also holds 
if the coefficients a, are 
complex. 


Repeated roots are sometimes 
referred to as equal roots or 
coincident roots. 


Note that a? + b? is always 
positive, unless a = b = 0. 


The modulus of a complex number z = a + bi is Va? + 6, written |z|, so 
the rule for division can be written, for complex numbers u and v, as 


U Uv 


vu 


*Exercise 4.1 

Let v = 3 — 47 and w = 2 —7. Evaluate each of the following. 
(a) U (b) || (c) v-w (d) vw 

(e) w/v (f) 1/w (g) w? (h) 2w — 3v 


*Exercise 4.2 
Solve (for x in C) the quadratic equation 2x? +22 +1=0. 


If a quadratic equation with real coefficients has complex roots (as in Exer- 
cise 4.2), then these always form a pair of complex conjugates (of the form 
ax bi). 


4.2 Polar form 


Polar coordinates 


Polar coordinates provide an alternative way of representing points in 
the plane. Figure 4.1 shows a point A with Cartesian coordinates (x, y) and 
polar coordinates (r, 6). The quantity r is the distance from A to the origin, 
so r > 0. The angle 0 is measured anticlockwise from the x-axis. (Negative 
angles correspond to measuring clockwise from the z-axis.) It is convenient 
to allow @ to take any real value, but this has the consequence that the polar 
representation of a point is not unique. For example, (r,@) and (r,@+ 27) 
provide polar coordinates of the same point. We can see from Figure 4.1 
that if a point has polar coordinates (r,@) and Cartesian coordinates (x, y), 
then 


x=rcos@é and y=rsiné. 


These equations allow us to translate from polar to Cartesian coordinates. 
To translate from Cartesian to polar coordinates, we can use (see Figure 4.1) 


r=~\/-+y", cos@=a2/r, sme@=y/r (0). (4.1) 


Equations (4.1) do not have a unique solution for @ in R, but they do have 
a unique solution in the range —7 < 6 < 7. 


*Exercise 4.3 


What are the polar coordinates (r,@) of each of the following points, for 0 
in the range —7 << 0< 7? 


(a) (2,0) (b) (1) (&) (-1,-1) 
(d) (4,0) — (e) (0,4) (f) (-V3,1) 


Section 4 Complex numbers 


This follows from the 
formula (2.8) for the solution 
of a quadratic equation. 


We shall use angle brackets to 
distinguish polar from 
Cartesian coordinates. (This 
is not a universal convention.) 


ya 
y=rsing¢y ------ A (x, y) 
| 
r | 
| 
| 
| 
el i a 
x=rcosq x 
Figure 4.1 


If r = 0, then we can choose 
any value for 6. 
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The polar form of a complex number 


A complex number x + yi can be represented geometrically on an Argand 
diagram as the point with Cartesian coordinates (x,y). For example, Fig- 
ure 4.2 shows on an Argand diagram the point 3 + 27, with real part 3 and 
imaginary part 2. 


Combining polar coordinates and the Argand diagram leads to the polar 
form of a complex number. For z = x + yi, find the polar coordinates (r, 6) 
of the point with Cartesian coordinates (x,y). Then, using the relation 
between polar and Cartesian coordinates, we have 


z=x2+yi=a+iy=rcosé+irsind = r(cosé+isin8). 
This is the polar form of z. Here, r = \/x?+y? = |z| is the modulus 
of z. We call 0 an argument of z. As noted above, 0 is not unique, but 
there is a unique value of @ in the range —7 <@< 7. This is called the 
principal value of the argument, and we write it as Arg(z). When there is 
no possibility of confusion, we often write (r,@) as shorthand for the polar 
form r(cos 6 + isin @). 


Exercise 4.4 


oh 


If a complex number z has polar form (2,—7), what is its Cartesian form? 


Exercise 4.5 


Express each of the following complex numbers in polar form, choosing the 
principal value of the argument. 


(yo (brie 4) laa 
(d)4 (e) 4¢  (f) -V3+i 


Multiplication of complex numbers in polar form 


Multiplication of complex numbers is simpler in polar than in Cartesian 
form. We have 


(71,01) X (72,02) = (rire, 01 + 2). (4.2) 


That is, to multiply two numbers in polar form, we just multiply their moduli 
and add their arguments. The equation above can be justified as follows: 


r1(cos 6; + isin 61) x r2(cos 62 + isin 2) 
= 1112(cos 91 cos 62 — sin 0; sin 62 + i(sin 01 cos 62 + cos 0; sin 82)) 
= r1ro(cos(6; + 02) + 7sin(A + 62)), 
using trigonometric identities (3.4) and (3.3) from Section 3. 


From (4.2), we can deduce a formula for division of complex numbers in 
polar form: 


(r1, 1) > (rg, 02) = (ri/r2, 91 = 62). 


Also, if we multiply the complex number (r, 0) by itself repeatedly, we obtain 
a formula for an integer power of a complex number: 


Oo) SO" ne). 
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2 1% 1 2 3 4 


-i7J 


-2i- 


Figure 4.2 


Note that although 0; + 2 is 
an argument of the product, 
it may not be the principal 
value of the argument. 


The proof of this is not 
difficult, but we omit it for 
reasons of space. 


With r = 1, and written as 
(cos @ + isin #)” = cosné + isin nd, 


this result is known as De Moivre’s Theorem. 


Exercise 4.6 
Find (1 —i)”°. 


Complex exponentials 


For a complex number z = «+ zy, we can define the complex exponential 
e* by the formula 


e* = e*(cosy + isiny). 
We choose this definition because it works! That is, this complex exponential 
behaves, as we would hope, like the real exponential function. In particular, 
it retains the property that, for any complex numbers u and v, 


et x e® = ett, 


In the case when x = 0, the definition of the complex exponential gives 
e'” = cosy + isiny. 
This leads us to a third way of expressing a complex number, which is often 


convenient. If z has polar form (r,@), then we have 


z=r(cos@+ isin 6) = re”, 


where re’? is referred to as the exponential form of the complex number z. 


In this form, r is the modulus of z and @ is the argument of z. As with the 
polar form, the value of @ is not unique, but there is a unique choice of @ in 
the range —7 <0< 7. 


*Exercise 4.7 


Let z = (r,0). Use the exponential form of z to find Re(ze™*). 


5 Differentiation 


The concepts and techniques of calculus are central to many of the mathe- 
matical methods discussed in this course. In this section, we consider dif- 
ferentiation. 


5.1 Rates of change 


Diagnostic test 5.1 


Do Exercise 5.3 on page 34, and check your answers with the solutions given 
on page 56. If you are happy with your answers, you may proceed directly 
to Subsection 5.2. 


Section 5 Differentiation 


Abraham de Moivre 
(1667-1754) was born in 
France but spent his adult life 
living and working in 
England. 


This equation is known as 
Euler’s formula. 

Leonhard Euler (1707-1783) 
was a prolific mathematician, 
making many fundamental 
contributions to diverse areas 
of mathematics and science. 
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Differentiation gives the rate of change of one variable with respect to 
another. As in Section 2 (see page 10), suppose that a reservoir contains 
V cubic metres of water at time ¢ minutes after midday on 1 June, where 


V =2~x 10° — 15t. 


For a linear function such as this, the rate at which V is changing as t 
changes is the same at all times t. (The volume of water is falling by the same 
amount each minute.) This corresponds to the fact that a linear function 
has a straight-line graph, whose gradient (or slope) is the same everywhere. 


Now consider a non-linear function, such as 
s = 4(-9.81)t? + 10¢ + 2, (5.1) 


which was used on page 14 to model the height (s metres at time t seconds) 
of a ball thrown vertically upwards. The rate of change of height with time, 


d . . : ee 
written 7a is equal to the velocity v of the ball, and this varies with time. 


This function has a graph that is a parabola, and the gradient (or slope) of 
a parabola varies from point to point. Using rules discussed below, we can 
differentiate (5.1) to obtain 

ds 


= — = —9.81t+ 10. 
v di + 


Thus differentiation of the function s = f(t) produces another function, 
v = f’(t), called the derivative or derived function of f. At each value 
of t, f’(t) = —9.81t + 10 gives the gradient of the graph of (5.1). 


The gradient of a general graph y = f(x) at a particular point x = 29 gives 
the derivative f’ of the function f at that point. The gradient of f at x9 may 
be defined as the limiting value of the gradient of the chord AB in Figure 5.1, 
as B approaches A. The gradient of this chord is (f(ao +h) — f(xo))/h. The 
process ‘B approaches A’ corresponds to h tending to 0, which we write as 
h — 0. The definition applies only to suitable functions (called ‘smooth’), 
where this limit exists and is the same whether h approaches 0 through 
positive or negative values. Thus we may formally define the derivative of 
f at x by 


; +h) — f (20) 
f' (xo) tim ( h 
Working from this definition, we can obtain the derivatives of the various 
standard functions discussed in Sections 2 and 3. (The details of this need 
not concern us here.) These derivatives are tabulated in the Handbook, and 
we shall use them as required. 


Derivatives can also be found using the computer algebra package for the 
course. However, it is not always convenient to use the computer (and indeed 
it will not be available to you in an examination), so it is useful to be able 
to perform differentiation by hand, at least in relatively simple cases. To do 
this, we combine two elements: 

e derivatives of standard functions; 

e rules for differentiating combinations of functions of various types, in 

sums, products, quotients and compositions. 


The simplest rules concern constant multiples and sums. In general, the 
derivative of a combination af(x) + bg(x), where a and 6 are constants, is 
af'(x) + bg'(x). This rule enables us to differentiate (5.1) to obtain 
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YA 


| 
| h | 
| 


[fl +h) — flxo) 


Figure 5.1 


The more complicated rules 
are discussed in 
Subsection 5.2. 


ay 


ds, ae d d 
= : — —(2 
= $(—9.81)(2t) + 10(1) +0 
= —9.81¢+ 10 


(using the derivative of the standard function t” for n = 2 and n = 1, and the 
fact that the derivative of a constant is 0). Similarly, if V = 2 x 10° — 15t, 
we find its derived function to be V’ = —15. This is constant (not dependent 
on t), corresponding to the fact that this function has a straight-line graph 
(with constant gradient). 


There are various notations for derivatives, some of which we have used 
above. We shall use whichever is convenient in a particular context. No- 
tation expressed purely in terms of variables, such as ds/dt, is referred to 
as Leibniz notation (after its inventor, G. W. Leibniz (1646-1717)). This 
notation is extended to write, for example, 


d 
Het +65sin2t) or (ax + bax”). 


Leibniz notation is sometimes a little clumsy, and may be inconvenient in 
situations where the role of functions is prominent. There the more modern 
function notation of adding a prime (’) to the function name is preferred. We 
can then write, for example, f’(3) to mean the value of the derived function 
of f at 3. Sometimes we find it convenient to mix function and variable 
names, and write, for example, s’, rather than introducing a separate name 
for the function relating the variables s and t. When using Leibniz notation, 


ds ae . 
we may sometimes write a) to emphasize that this derivative is a function 


of t, or (4) to mean the value of the derivative when t = 4. Some of the 


simpler forms of notation are open to ambiguity if used in inappropriate 
contexts, and at times we need to be careful about how we express things, 
for example by using the function notation in a precise manner. 


Differentiation of a derivative produces the so-called ‘higher’ derivatives. 
If, for example, we have f(x) = x? + 52, then differentiation gives f’(x) = 
3a? +5. This derivative is itself a function of x, and can be differentiated 
again. This gives the second derivative as f”(x). (In the above example, 


d (d 
f"(«) = 6z.) In Leibniz notation, we write the second derivative, da (<*) ; 
2 x \ dx 


as a Differentiating yet again leads to the third derivative, written 
x 


d3 
ot or f(x), which may also be written f@) (x). (f!"(x) = 6 in this case.) 
The process can be continued, and a general nth derivative may be written 

a er 
as in or f(")(x), where n is referred to as the order of the derivative. 

2 

There is one final piece of notation to mention. We so often find that time 
(habitually denoted by t) is the independent variable that there is a separate 
notational convention for differentiation with respect to time. We use a dot 
over the variable to indicate a first derivative with respect to t, and two dots 
to indicate a second derivative. So if x(t) is the position of an object as a 
function of time t, then «(t) means the same as x’(t), and is the velocity of 
the object, while Z(t) means the same as x(t), and is its acceleration. 


Section 5 Differentiation 


ds 
In text, — may be written as 


ds/dt, to save space. 


We may sometimes write s’(t) 
if we want to emphasize that 
s’ is a function of t. 


The derivative dy/dx is 
sometimes referred to as the 
first derivative. 


This notation is attributed to 
Isaac Newton (1642-1727), 
and so is sometimes referred 
to as Newtonian notation. 
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The next four exercises offer practice in differentiating standard functions, 
and constant multiples and sums of these. The Handbook contains a num- 
ber of standard derivatives, and you can refer to that in answering these 
exercises. You will, however, find it helpful later in the course if you are 
able to differentiate polynomials, exponentials and trigonometric functions 
without reference to the Handbook. 


Exercise 5.1 


Suppose that an object is moving in a straight line so that its position x 
(measured from a chosen origin) is related to time t by the equation 


x = 5+ 7cos(3t 4+ 2). 


(a) Find expressions in terms of t for the velocity «(t) and acceleration i(t) 
of the object. 


(b) Use the above equation to eliminate t from your expression for #(t), and 
hence find a relationship between the position and acceleration of the 
object that holds at all times. 

Exercise 5.2 


The weekly wage bill of a company, t years in the future, is projected to be 
£B, where 


B = 10° exp(0.04t). 
Find an expression for the rate at which the wage bill will be rising in t years’ 
time. What will this rate of rise be as a percentage of the wage bill at the 
time? 
*Exercise 5.3 


Calculate the following derivatives. 
d 
(a) I where y = 1 — 0.9exp(—0.5z). 
¢ 
(b) F’(2), where F(x) = 324 — 4x + 1. 


(c) ay where y = Int (t > 0). 

(d) F’(%), where F(x) = 3sec(2x) — 4cos(—3z). 

(e) g”(0), where g(t) = acos(3t + ¢) + bsin(3t + ¢) (and a, b and ¢ are 
constants). 

Exercise 5.4 

Calculate the following derivatives. 


(a) vu’(z), where v = 3tanz + 2cos z. 


(b) “ at t = 75, where y = Asin 3t + Bcos 3¢t (and A and B are constants). 
(c) £4), where f(t) = 2sin 3t. 
(d) f"(y), where f(y) = arctan(3y). 
d 
(e) i when x = 0, where z = In(cx + d) (and c and d are constants, with 
x 
ad >). 
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5.2 Differentiating combinations of functions 


Diagnostic test 5.2 


Do the starred parts of Exercises 5.5, 5.6 and 5.8 below, and check your 
answers with the solutions starting on page 56. If you are happy with your 
answers, you may proceed directly to Subsection 5.3. 


As we have shown, derivatives of constant multiples and sums are calculated 
in a natural way. For derivatives of other combinations — products, quo- 
tients and composites — the rules are less obvious. These rules are given 
below and in the Handbook; you will find it advantageous later in the course 
if you are familiar with these rules and can use them without reference to 
the Handbook. 


Product Rule (fg)'=/f'g+fa’. 


Or, in Leibniz notation, 


e (wo) = a, + le 
dx dx dx 
! Wee / 
Quotient Rule (£) = Ig Fg : 
g g 
Or, in Leibniz notation, 
du du 
d e _ i ae 
dz \v v2 
Example 5.1 


Find h'(x), where h(x) = x? cos 2z. 


Solution 
The function h(x) is a product, f(x)g(x), with f(x) = x, g(x) = cos2z. 
We have 
f'(z) =32? and g/(x) = —2sin2z. 
So, using the Product Rule, we have 


h' (x) = 32% cos2x —22°sin2r. Hl 


Exercise 5.5 
In x 
on 
*(b) Find f’(t), where f(t) = t? n(3t + 4). 
(c) Find g’(0) (in terms of the constants A, B and C), where 
g(t) = (At+ B)sin(At+C). 
*(d) If the position of an object at time t is given by e *' sin 4t, find its 
velocity and acceleration. 


*(a) Find ae where y = 


Section 5 Differentiation 


A useful way of remembering 
this rule is: derivative of first 
times second, plus first times 
derivative of second. 


A useful way of remembering 
this rule is: derivative of top 
times bottom, minus top 
times derivative of bottom, all 
over bottom squared. 
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The rule for composite functions is a little more complicated to use. 
Composite Rule If h(x) = g(f(x)), then h’(x) = g/(f(x)) f' (2). 


Expressed in Leibniz notation, this rule looks rather different: if y is a 
function of u, and u is a function of x (so y = g(u) and u= f(x)), then 


d Ad In this form, the Composite 
Mt bal ea Rule is referred to as the 
dx dudzx Chain Rule. 


In Section 2 (page 20) we mentioned an example of a composite function. 
There we had velocity v related to time t by the function 


v = g(t) = 10- ot, 
and time related to height s by the function 
10 — 100 + 4g — 29s 
£=fils)= , : 


If we wish to calculate the rate of change of v with respect to s, it is natural 
to use the Composite Rule in its Leibniz form: 


du _ dvdt 

ds dt ds’ 
Now 

du _ 

oS 
and 


dt  d (10—/100+4g-29s\ _ 1 
ds ds g ~ 4/100 + 4g — 298° 


So, using the Composite Rule, 


du _ 1 _ g 
ds ~\ fl00t4g—29s/ Vl00+4g—298 


(You can check that this is correct by differentiating Equation (2.13) di- 


rectly.) 

Example 5.2 

Find f’(x), where f(x) = sin? z. 

Solution 

If we let u =sinz, then we have f(x) = u°. The recognition of sin® x as a 

composite function, and of 

We then have how to break it down into two 

df du 2 9 parts, each consisting of a 
f'(z) = ———— = 3u° cosz = 3sin* x cos x, standard function, is the key 

du dx 


‘ ; : to differentiating it. 
replacing the variable u by sinz. 


Your proficiency with differentiation will depend on your experience and will 
develop with practice. It will be helpful for your study of MST209 if you 
are able to differentiate expressions such as that in Example 5.2 without 
recourse to the computer algebra package for the course — and, ideally, 
without even needing to refer to the Handbook. But such proficiency may 
take some time to develop, and while it is developing feel free to check your 
work using the computer. 
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Exercise 5.6 
Use the Composite Rule to differentiate each of the following. 


*(a) y = exp(t?) (b) f(x) = (3a? + 4)® 
*(c) z= tan(3v + 4) (d) g(z) = V4— 2? 
(0) f(2)=——— 


(vita) 


Exercise 5.7 


Differentiate the following functions. 


(a) y=sec ( ) (b) z=? exp(# +1) 


x 
x2 +1 


Suppose that we want to find the gradient at the point (2,1) of the tangent 
to the ellipse with equation 


a” + dy? = 8. (5.2) 


We want dy/dx at x = 2. We could start by expressing y as a function 
of x, but a more convenient approach is to differentiate the equation as it 
stands. To differentiate y? with respect to x, we use the Composite Rule, 
d(y*) dy _, dy , = 
= 2y—. So, differentiating both sides of Equation (5.2) 
dy dx dx 
with respect to 7, we obtain 


dy _ 
dx 
When «x = 2 and y = 1, this gives 4+ 8dy/dz = 0, so dy/dzx = —5. There- 


fore the gradient of the tangent to this ellipse at (2,1) is —5. 


and obtain 


2x + 4(2y) 0. 


Differentiation with respect to x of an expression such as x? + 4y?, where y 
is a function of x, is known as implicit differentiation. 


*Exercise 5.8 


(a) Use the Product and Composite Rules to find the following in terms of 
x, y and dy/dz. 


d 
() Say) a) Se") 
(b) Find the gradient at the point (—1,1) of the tangent to the curve 
a? + a7y + y? =, 


Just occasionally, we need to consider differentiation of a complex-valued 
function, of the form 


f(t) = g(t) + th), 
where g and / are real functions. Differentiation of such a function is defined 
in a natural way, as 


f(t) = g(t) + th'(t). 
So, for example, if f(t) = cos 3t + isin 3t, then f’(t) = —3sin 3t + 3% cos 3t. 


Section 5 Differentiation 


These differentiations involve 


more than one rule. 
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Unit 1 Getting started 


Exercise 5.9 
Find the second derivative of the function f(t) = cos 2t + isin 2t. 


5.3 Investigating functions 


Diagnostic test 5.3 


Do Exercise 5.10 on page 39, and check your answers with the solutions 
given on page 57. If you are happy with your answers, you may proceed 
directly to Section 6. 


Faced with an expression made up of some combination of standard func- 
tions, how might you investigate its behaviour? As an example, consider 
the function 

v 


faq + 1.5u + 0.00802’ 


where we would like to see how f varies as v varies. 


A sketch graph of f against v helps with this, and a computer algebra 
package or graphics calculator will provide such a graph. However, it is not 
always obvious for what range of values to plot the graph, so it is helpful 
to be able to deduce some information about the general behaviour of a 
function ‘by hand’, without recourse to a machine. Such information can 
also be used to cross-check results obtained from a machine, and to flesh 
out the picture more fully. This example will be continued in Exercise 5.11, 
but first we shall make some general remarks about sketching graphs. 


Questions that you might consider when sketching a graph include the fol- 
lowing. 


e Are there any points where the function is not defined? 
(This often happens if the expression is a quotient in which the denom- 
inator can be 0.) 


e Where does the function cross the axes? 

(To find the points where the function crosses the horizontal axis, solve 
the equation f(x) =0 for x. The function crosses the vertical axis at 
the point y = f(0).) 

e How does the function behave for large and small values of the indepen- 
dent variable (or at the endpoints of the domain if this is an interval)? 
(For a function with domain R, examine the values of the function at 
large positive and negative values. For a function defined on an interval, 
simply evaluate the function at the endpoints.) 


e On which parts of its domain is the function increasing, and on which is 
it decreasing? 
(You can look at the sign of the gradient at various points.) 
e Are there any local maximum or minimum values? 
The last question can be answered using differentiation. A stationary 
point of a function f(x) is a value of x where f’(x) = 0. Local maxima and 


local minima occur at stationary points, although a stationary point need 
not necessarily be either. Figure 5.2 illustrates such stationary points. 
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A function f(z) is 
increasing on an interval if 
f(x) increases in value as x 
increases (or equivalently if 
f'(x) > 0). Similarly, f(a) is 
decreasing on an interval if 
f(a) decreases in value as x 
increases (or equivalently if 


f'(x) < 0). 


gradient is 0, 
but neither 
maximum nor 
minimum 

local 
minimum 


ay 


B minimum 


Figure 5.2 


In models, we often wish to find the overall maximum or overall minimum 
of some function, usually referred to as the global maximum or global min- 
imum, respectively. The global minimum or maximum may well occur at 
a stationary point; but caution is needed, for they need not necessarily do 
so. For example, the global minimum of the function with domain [0, oo) 
illustrated in Figure 5.2 occurs at an endpoint of its domain (a = 0), which 
for that function is not a stationary point. In fact, a function need not 
have a global maximum or global minimum. For example, the function f(x) 
in Figure 5.2 exceeds the local maximum value when z is large, but never 
reaches a global maximum. However, the value of f(a) in Figure 5.2 does 
not become arbitrarily large either: it is bounded above by (but never actu- 
ally attains) the value A (but it comes arbitrarily close as x becomes large). 
We refer to the line y = A as an asymptote of the graph of f. 


Having found a stationary point of a function, we can determine whether it 
is a local maximum or a local minimum either by looking at the sign of the 
second derivative at the stationary point, or by looking at the sign of the 
first derivative to either side of the point. (For example, if b is a stationary 
point, and f’(z) is positive for x less than b and negative for x greater than b, 
or if f(b) is negative, then b is a local maximum. These tests are given in 
detail in the Handbook.) 
*Exercise 5.10 

Find any stationary points of the function 

y(t) =5—2(x+1)e2* (x >0). 


Classify these as local mimima or local maxima or neither, and evaluate 
y(x) at these points. 


Example 5.3 
Suppose that 
(a? — 3)y =a —2. 
Sketch a graph of y against 2. 


Solution 


g—2 
We have y = a3’ but need to note that this expression for y is not 
A be — 


defined if 2? — 3 = 0, ie. if e = +3. We can see that y = 0 if (and only if) 
x = 2, so the graph crosses the x-axis at this one point. If x is large (positive 
or negative), then y will be close to zero. 


Section 5 Differentiation 


A function f(x) is bounded 
above by a number A if 
f(a) < A for all x in the 
domain of f. Similarly, f(a) 
is bounded below by B if 
f(x) > B for all x in the 
domain of f. The numbers A 
and B are referred to as an 
upper bound and a lower 
bound for f, respectively. 
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Unit 1 Getting started 


To look for stationary points, we use the Quotient Rule to calculate 


dy — (1)(a* —3) —(@—2)(22) | —a? +42 -3 
dx (x? — 3)? (a? — 8)2 
This is zero if x? — 4x +3 = 0; ie. if ¢ =1 or 3. The second derivative 
is a bit complicated to calculate, so it is easier here to look at the sign of 
the first derivative near « = 1 and x = 3 to check whether these stationary 
points are local maxima or minima. If x is just less than 1, then dy/dx Try x =0.9 and #=1.1. 
is negative, while if x is just greater than 1, dy/dz is positive, so x = 1 is 
a local minimum. For x just below 3, dy/dz is positive, while for x just Try x= 2.9 and # = 3.1. 
above 3, it is negative, so x = 3 is a local maximum. Note that at x = 1, 
y=5; while at « = 3, y = @-. 


We can incorporate all this information (plus other information, such as 
the behaviour of y near z = +\/3 and the value of y at certain values of 2, 
e.g. x = 0) in constructing a sketch graph, as in Figure 5.3. 


YA 


Again we refer to the lines 
a = V3 and « = —V3 as 
asymptotes of this graph. 


Figure5.3 


A continuous function is one whose graph can be drawn without lifting your 


pen from the paper. The function f(«) = (x — 2)/(x? — 3) is not continuous 


on any domain containing either 3 or —V3. 


A smooth function is continuous and has a continuous derivative. For 
example, the function 


32, x > 0, 
fa) = {5 x <0, 
is continuous, but is not smooth (since its derivative is not continuous at 
=). 


Exercise 5.11 
(a) Sketch a graph of the function 
U 
P() = T7750 10.0082 


In particular, find: any values of v for which f(v) is zero; any values 
of v for which f(v) is not defined; and any local maxima or minima of 
f(v). Also, indicate how f(v) behaves as v becomes large. 


(v > 0). 


(b) Find the global maximum and minimum of f. 


40 


6 Integration 


Subsection 6.1 provides a reminder of the basic idea of integration as ‘revers- 
ing differentiation’. Subsection 6.2 discusses how we may calculate relatively 
simple integrals by hand. In Subsection 6.3, we look at two techniques for 
finding more complicated integrals by hand. 


As well as ‘reversing differentiation’, integrals also arise as the limits of 
certain sums. In Subsection 6.4, we see how this can lead to integrals arising 
in models. 


6.1 Reversing differentiation 


Diagnostic test 6.1 


Do Exercises 6.1 and 6.2 below, and check your answers with the solutions 
starting on page 57. If you are happy with your answers, you may proceed 
directly to Subsection 6.2. 


Throughout this course you will meet a variety of differential equations. 
These are equations involving the derivative of a function, for example 

ds 

Fla 5t + 7. (6.1) 
The objective is usually to ‘solve’ the equation by finding an expression for 
the function itself (rather than its derivative). To do this involves ‘reversing’ 
the differentiation, and this process is referred to as integration. In the 
above example we integrate both sides of the equation with respect to t, 
obtaining 


s= [(or+ rat 
To evaluate this integral, you can use the table of standard integrals in the 
Handbook. These show that | tdt = 5t and / 1 dt = t, and on integration 
we obtain 

s=30U4+7t+¢, (6.2) 


where c may be any constant. To confirm this, note that with s given 
by (6.2), we have 

ds d js 9 

—=— (2t°+7t = 5t+ 7, 

oa (3¢° + 7é +c) + 
as required by (6.1). Since c may be any constant, we see that the differential 
equation (6.1) does not have a unique solution. 


Generalizing, suppose that f is a known function, and 
F'(a) = f(a). 

We write the general solution of this differential equation as 
P(0)= [ f(x) ax, 


where the right-hand side, { f(x) dz, is called the indefinite integral of 
f(a), and the function to be integrated, f(a), is called the integrand. 


Section 6 Integration 


The constant c is often 
referred to as an arbitrary 
constant or a constant of 
integration. 
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Unit 1 Getting started 


Given a differential equation such as 
1 
F'(x) = —— , 
finding a function F’ whose derivative is the function on the right-hand side 
is a non-trivial task. For (6.3), we are lucky, since the table of standard 


(6.3) 


1 
derivatives in the Handbook gives —(arctan 2) = . Hence we have 
dx 1+ 2? 
1 Any function that 
/ 5 dz = arctanz + ¢, differentiates to 1/(1 + 27), 

1+2 such as F(a) = arctane +5, 

where c is an arbitrary constant. is referred to as an integral 
(or antiderivative) of 

In contrast, consider finding the integral 1/(14 2?). 


/ exp(—2”) dz. 


At first sight, this might seem no harder a problem to solve than (6.3). 
In fact, however, it is impossible! To be more precise, there is no simple 
combination of the standard functions (polynomials, sin, cos, exp and In) 
that when differentiated gives exp(—2?). 


Finding explicit expressions for integrals is a much harder task than finding 
derivatives. The rules of differentiation ensure that we can, in principle, 
find an explicit expression for the derivative of any combination of standard 
functions. The equivalent is not true for integrals. What is more, even 
where integrals can be found, this may be a messy process. The methods 
whereby particular functions can be integrated depend on recognizing what 
happens to work in various particular cases. 


Integrals are readily found if they appear in the table of standard integrals 
in the Handbook (as do f[ e** dx and f a' da, for example). So are the 
integrals of constant multiples and sums of constant multiples of these, such 
as [(4e?* + 92x") dx. There are also integration techniques that enable you 
to find some more complicated integrals. Integration by substitution is based 
on the rule for differentiating composite functions, while integration by parts 
is based on the rule for differentiating a product. There are brief reminders 
of these two techniques in Subsection 6.3. 


The table of standard integrals in the Handbook contains quite a wide se- 
lection of integrals. Some of these integrals are deduced from the table of 
standard derivatives, others from using integration by parts or substitution. 
You can regard them all as the fruit of others’ experience, and draw on 
them as needed. The correctness of an integral, obtained either by using 
the Handbook or from the computer algebra package for the course, can be 
verified by differentiation. 


*Exercise 6.1 


Use differentiation to verify that the following integrals are correct (where 
a #0 is a constant and c is an arbitrary constant). 


; x La 
(a) / wsinardx = ——cosaz + —5sinar +c 
a a 


1 Tw TT 
(b) [vnarde = — 7 In(cos ax) +e (-%<ar<) 
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Section 6 Integration 


*Exercise 6.2 


(a) Use implicit differentiation (and a trigonometric formula) to show that 
if « = tany, then 


dys 1 


dx 1+a?° 


dx = arctanz +c. 


1 
Hence confirm that / 
1+2? 


(b) Use implicit differentiation of « = siny (and a trigonometric formula) Here -1 <2 <1, 
to deduce an expression for the indefinite integral =F <U< 39- 


1 
———. dz. 
a= 


Check that your result agrees with the one in the Handbook. 


6.2 Evaluating integrals 


Diagnostic test 6.2 


Do Exercises 6.4 and 6.6 below, and check your answers with the solutions 
starting on page 58. If you are happy with your answers, you may proceed 
directly to Subsection 6.3. 


Your first recourse for finding an integral by hand is the table of standard Or, preferably, use your 
integrals in the Handbook. If the integrals of functions f and g are known, memory! 

then the integral of af + bg, where a and 6 are constants, is readily found, 

using the rule 


[ere + bg(2)) de =a [ f(x) dx +b [ g(x) ae 


So, for example (referring to the Handbook for [ e? dx and f a’ dex), 


[de +oat)de=4 fPar+9 fat as 


— 4(ze**) + 9( 52°) +e 

=e" 4 22° ie: 
Sometimes algebraic manipulation can transform an expression to be inte- 
grated into a more amenable form. For example, the manipulation 


3a? + Qe 32? 


_ 4 2x 
fe Ya Va 
transforms the expression on the left into a sum of constant multiples of func- 
tions in the Handbook table. Less obvious transformations can be achieved 
using trigonometric formulae. For example, using cos 2a” = cos? a 


_ 3773/2 4 Qe1/2 


x —sin* x 
(see Exercise 3.3(a)(ii)) and sin? 2 + cos? x = 1 (Identity (3.2)), we obtain 
cos 2x = 2cos*? x — 1. Rearranging this gives 

cos” z = $(1+cos 22), 


which enables us to integrate cos? x. 
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Exercise 6.3 
Use the identity 


cos” az = (1 + cos 2az) 


(where a # 0 is a constant) to obtain [ cos? ax dx. 


At times, attention needs to be paid to domains, to avoid giving, as in- 
tegrals, expressions that are not defined. For example, In x is defined (as 


1 
a real number) only for x > 0, so J: dx = |lnx+c holds only for x > 0. 
x 


1 
To integrate 1/x for x < 0, we use J: dx = In(—x) +c. The Handbook 
x 
gives domain restrictions where necessary. We sometimes write, for con- 
1 
venience, [oe =In|z|+c (4 £0), but do be aware that this covers 
x 


the two separate cases of x <0 and x > 0. Similarly, we sometimes write 


1 
dx = —Injaz + b|+ c (an +640). 
ax +b a 


*Exercise 6.4 


Find the following integrals. 


(a) [eae (b) [ ssec2(at) dt (c) /uomp” 
(d) /rye (<3) €) eer (y > 3) 


Exercise 6.5 


Find the following integrals. 


(a) [ec cos(—2t) + 8sin 4t) dt (b) one dt (-3<t<3) 


(c) / eet (t<0) (d) / (2140 -7) dt (t>0) 


The next example again uses a standard integral from the Handbook, but 


requires careful matching of parameters and attention to domains. 


Example 6.1 


1 1 
Por A>0,2>,find f ae. 


Solution 


We can match the integrand with that of a standard integral by writing it 


in the form 
1 —1 
z(1—Az) A(x—O)(x—- 4) 
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For x < 0, 
d 1 
5 (in(—2)) = — x (-1) 
v —2x 
wc 
mee 


using the Composite Rule. 


Use the standard integrals 
given in the Handbook as 
necessary. 


Use the standard integrals 
given in the Handbook as 
necessary. 


1 
The Handbook gives the integral / ——_——__ dx for b < a. To match 


(@—a)(e—5) 
that, choose b = 0 and a = 7 Since x > 7 we must choose the standard 
integral for the case a < x. So we obtain 


laa" a) Geen” 


*Exercise 6.6 
1 
(a) For k > 0, —k<u<hk, Pane 


(Hint: Remember that v2 — k? = (u+ i ) 


(b) Fora >0, b>0, - ous vant fg 
22 


6.3 Integration by parts and by substitution 


Diagnostic test 6.3 


Do Exercises 6.7 and 6.8 below, and check your answers with the solutions 
starting on page 58. If you are happy with your answers, you may proceed 
directly to Subsection 6.4. 


In this subsection, we look briefly at two methods for evaluating more com- 
plicated integrals. You will use these methods later in the course. For 
now, it is particularly useful to recognize the type of integral illustrated in 
Equation (6.5). 


Integration by substitution 
The formula for integration by substitution is 


/ COO te / f(u) du. (6.4) 


Or, in Leibniz notation, 


[tw Far = f fu) du 


We say that we have ‘substituted’ u = g(x). The following example shows 
one way of executing integration by substitution. 


Section 6 Integration 


45 


Unit 1 Getting started 


Example 6.2 
Find [esine2 + 327) dex. 


Solution 
Let u = 2+ 327, so du/dx = 62. 
We have 


[esine + 327) de = 7 [x sin(2 + 32”) dx 


du 
1 a 
=} f sinu ae 
=} fsinudu 


= —cosu +c¢ 
= —4 cos(2 + 3x?) +c, 


substituting at the end for u in terms of z. 


Notice in this example that the integrand is a composite function, 
sin(2 + 327), multiplied by (in effect) the derivative of the ‘inner function’ 
2+ 327. We are only ‘out’ by a constant factor (of é), and this shows up in 
the final expression. In such a situation it is relatively easy to see that the 
substitution u = 2+ 3x? will help to simplify the integral. There are other 
useful types of substitution that are much less apparent, but we need not 
discuss those here. 


One form of integral comes up sufficiently often to be worth separate men- 
tion. We have, for g(x) 4 0, 


/ g(x) dx = In|g(x)| +c. (6.5) 
g(x) 


This follows from treating the cases g(x) > 0 and g(x) < 0 separately, and 
using the substitutions u = g(x) and u = —g(x), respectively. When 
g(x) > 0, we have u = g(x) > 0, du/dx = g'(x) and 


ok 2 deena c= In(g(x)) +¢ 
[ob u- [oa Inu+c=In(g(xz)) +. 


When g(x) < 0, we have u = — 


> 0, du/dx = —g'(x) and 


g(x) 
C3 eee (ese AC ve a ee ee 
[Fee | sa =| 54 use In(—g(x)) +e. 


*Exercise 6.7 


Find the following integrals. 


(a) fv exp(2 + 4y®) dy (b) J cosusin? yay 


(c) [ivi-eat 42724) (@) | _ 


1+ 2? 


©) {ae © few ween 


1+sin?t 
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In (6.4), we are taking 
g(x) = 2+ 327. 


Now g/ (x) = 62. 


Note how, in effect, a dx is 
xv 
replaced by du. 


Part (e) requires use of the 
trigonometric formula for 
sin 2t derived in 

Exercise 3.3(a) (i). 


Integration by parts 


The formula for integration by parts is 


| feos'e) ax = Flo)gle) — f Fe) g(a) ae. 


As with integration by substitution, this formula transforms an integral into 
a different one, and the key to successful use is to ensure that the ‘new’ 
integral is easier to evaluate than the original. 


Example 6.3 
Find poor dx. 


Solution 
In the formula, take f(z) =z and g/(x) =e~?”. Then f’(x) =1 and 
g(x) = —Ze~?", so 


[oe de = x a) jo se) dx 
Sotge at | 6" de 
2 2 
ol _ 1,-22 
= —5xe qe +e 


=-72r+lje*+c. Ef 


*Exercise 6.8 


(a) Use integration by parts to find pe dz. 


(b) Find [eer dx. (Use integration by parts, then the result of part (a).) 


6.4 Definite integrals 


Diagnostic test 6.4 


Calculate each of the following integrals. 


1 2 2 
3 3 3 
(a) fe _2)dx — (b) [e -2)dx  (c) fe ~aias 


Now check your solutions against those given below. If you are happy with 
the answers, you may proceed directly to Section 7. 


Solution 
(a) We have 


1 
/ (x? —2)dr= [52" — 22], 


(b) Similarly, 


Section 6 Integration 


Note that an arbitrary 
constant need not be included 
in the expression for g(x). 
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Unit 1 Getting started 


(c) We can either evaluate the integral directly, 


The indefinite integral is a function (or, to be exact, a family of functions 
containing an arbitrary constant). A different, though closely related, form 
of integral is the definite integral, whose value is a number. If we know 
an integral, say F’, of f, then a definite integral of f is readily found, since 
the value of a definite integral is given by 


b 
7 f(x) dx = F(b) — F(a). 


The difference F'(b) — F(a) is commonly written as [F (x)]°. 


So, for example, 


dé = [arcsin (40)], = arcsin 5 — arcsin0 = e -0= ne 


1 
1 
[ V4 — 6 
Exercise 6.9 


3/24 
Evaluate / Gia 


A rough-and-ready way of thinking of a definite integral i f(x) dx is as ‘the 
accumulation of the values taken by f(x) as x runs from a to b’. This ties 
up with a useful way of visualizing definite integrals, as areas. If f(x) > 0 
for a < x < b, then the definite integral (ip f(a) dx is equal to the area under 
the graph of f(x) between x = a and x = b (see Figure 6.1(a)). There is one 
point to be careful about here. If f(x) < 0, corresponding to a region below 
the x-axis, then we have a negative contribution to the integral, whereas 
area is always a positive quantity. Thus for a function f as pictured in 
Figure 6.1(b), i f(x) dx = area A; — area Ap. 


YA 


y=f@) 


av 


(a) (b) 


Figure 6.1 ie f(a) dx as an area: (a) with f(a) > 0; (b) in general. 
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Any choice of F’, an integral 
of f, leads to the same value 
for F(b) — F(a). 


Whereas indefinite integrals typically arise in solving differential equations 
(‘reversing differentiation’), definite integrals can arise in other circum- 
stances too, such as when an integral is seen as the limit of a sequence 
of sums. As an example, consider the following model, to estimate the 
number of seabird nests on an island. 


Example 6.4 


An island is modelled as a circle of radius 500 metres. The density of nests 
is greatest on the edge of the island, and least at the centre. (The birds 
prefer ready access to the sea.) The density is modelled as D, measured in 
nests per square metre, where 


D=0.1+ r/500 = 0.14 0.002r, 
where r is the distance from the centre of the island, measured in metres. 
Estimate the number of nests on the island. 
Solution 


Imagine the island divided by concentric circles into narrow ‘annular’ strips, 
each of width dr. Figure 6.2 shows a ‘typical’ strip, between circles of radius 
rand r+ or. 


The area of this typical strip is approximately 2rrdr (‘length x width’). The 
number of nests within this strip is (the area) x (the density of nests), and 
so is approximately 


2arérD = 2ar(0.1 + 0.002r) dr. 


The total number of nests on the island is the sum of the number of nests 
in all the strips. If we take the limit of this sum as dr — 0, then the sum 
converges to the definite integral of 27r(0.1 + 0.002r) between r = 0 (the 
centre of the island) and r = 500 (at the edge). That is, an estimate of the 
number of nests is 


500 
i Qnr(0.1 + 0.002r) dr = 6.021 x 10°, 
0 


so there are approximately 600000 nests. 


The equivalence of the two views of integration, as ‘reversing differentiation’ 
and as the limit of a sequence of sums, is assured by the Fundamental 
Theorem of Calculus, but that need not concern us here. Many texts use 
the limit of a sequence of sums (rather than ‘reversing differentiation’) as 
the basis for the definition of integration. 


End-of-section Exercise 


Exercise 6.10 


Find the following integrals (where a is a constant). 


500 
(a) / 2nr(0.1 + 0.002r) dr (b) [ext — 2y) dy 


» 4 ee | 
©) f saree (a) I. nO) [olor a) at 


2 sin 2x 
f t 3t dt ——____—_— 
(f) 7 cos(3t + a) @ f te 


Figure 6.2 


Section 6 Integration 
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Unit 1 Getting started 


7 Computer activities 


This section demonstrates how the computer algebra package for the course 
can assist in performing various tasks that have been studied in the unit. In 
particular, we can easily perform algebraic manipulations, such as simpli- 
fying expressions and solving quadratic equations. We can also solve equa- 
tions numerically when an algebraic solution cannot be found, and handle 
the arithmetic of complex numbers. Turning to calculus, you will see how 
to produce graphs, and how to differentiate and integrate. 


Use your computer to complete the following activities. 


Symbolic algebra on the computer 


Activity 7.1 
(a) Evaluate ay 
(b) Simplify (ayn, 
(c) Simplify each of the following. 
(i) aa? (ii) exp(2Inz + In(a# + 1)) (iii) Vx? 
(d) Try ‘simplifying’ 2° + 2”. 


Activity 7.2 


Expand each of the following. 
(a) (cv +2)? ~~ (b) (2243)? 


Activity 7.3 


Suppose that Iny = 2.83ln 2 + 0.37. Express y as a function of 2. 


Activity 7.4 
(a) Solve for x the following equations. 

(i) 227+72-4=0 (ii) c?+ae2-6=0 (iii) z?-22+2=0 
(b) Solve for x the quadratic equation abr? — (a+ b)x +1=0. 


Solving equations numerically on the computer 


Activity 7.5 


Find the values of x that satisfy the equation cos x = 0.22. 


Complex numbers on the computer 


Activity 7.6 
Evaluate the following. 
(a) (1+ %)(1 — 2%) (b) |44+3| 
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The solutions to these 
questions, and further 
comments, are given in the 
computer worksheets. 


Activity 7.7 


(a) Expand (cosx + isin z)*. Find Re((cos z+ isinx)*), and hence use De 
Moivre’s Theorem to obtain an expression for cos 4x in terms of sinx 
and cos x. 


(b) Expand cos 4”. Compare the result with your answer to part (a). 


Graphs and differentiation on the computer 


Activity 7.8 
Consider the function 

y(x) =5—2(x+1)exp(—gx) («> 0). 
(a) Obtain a graph of y(x) against x. 


(b) How does y(x) appear to behave as x becomes large? 


(c) Can you find the global maximum and global minimum values taken by 
y(a)? 


Activity 7.9 


d 
(a) Find a where y = 1 — 0.9exp(—0.5z). 


wg GY af 
(b) Find dn’ where y = ar 
2 


d 
(c) Find ae where y = Int. 


Integration on the computer 


Activity 7.10 
(a) Find the following integrals. 


(i) / speed ; cma 


t 
(b) Try to find the following integrals. 


(i) /o pe (ii) /—me 


Activity 7.11 


Evaluate the following definite integrals, first numerically and then symbol- 
ically. 


(a) fie (b) / er 


Activity 7.12 


Evaluate the following. 


Bl 
(a) | 7 tt symbolically (b) / 


=1 


2 
— dz, numerically 
x 


Section 7 Computer activities 


You may wish to refer back to 
Exercise 5.10 in answering 
this. 


The functions in parts (a) 
and (c) were also considered 
in Exercise 5.3. 
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Outcomes 


After studying this unit you should be able to do the following. 
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Interpret the following notation: scientific notation, Z, R, C, [a,b], (a, }], 
[a, b), (a,b), |x|, f(x), exp(x), e”, Inz, sina, cos xz, tan z, sec x, cosec 2, 
cot x, arccos x, arcsin x, arctan x, i (= /—1), Re(z), Im(z), Z, |z|, e* (for 


2 mn 
z in C), (7,6), him f(b), F(a), #"(2), F(a), B, TY, TY, a, 800, 


ge” deem” 
[ f(x) de, J? f(@) de, (F(a). 
Interpret the following terminology: integer, real number, interval, deci- 
mal places, significant figures, rounding; variable, dependent variable, in- 
dependent variable, parameter; function, domain, image set, codomain; 
linear function, quadratic function, polynomial function (and root of a 
polynomial equation); exponential function, logarithm function, power 
function, trigonometric function; composite function; complex number, 
complex conjugate, real and imaginary parts (of a complex number), 
modulus and argument (of a complex number); polar coordinates, polar 
form and exponential form (of a complex number), De Moivre’s Theo- 
rem, Euler’s formula; differentiation, derivative, Leibniz notation, Chain 
Rule (for differentiation), implicit differentiation, higher derivative; gra- 
dient of a function, stationary point, local maximum, local minimum, 
global maximum, global minimum, derivative of a complex-valued func- 
tion; continuous function; integration, integral, integrand, indefinite in- 
tegral, definite integral, arbitrary constant. 
Use the formulae for: the solution of a quadratic equation; the alge- 
braic properties of indices (and the exponential function); the algebraic 
properties of logarithm functions; various trigonometric identities; mul- 
tiplying complex numbers in polar form; finding powers in polar form; 
derivatives of standard functions; differentiating products, quotients and 
composite functions; standard integrals; integration by parts and by sub- 
stitution. 
Solve two simultaneous linear equations by Gaussian elimination. 
Factorize quadratic functions. 
Sketch the graphs of linear, quadratic, exponential and trigonometric 
functions. 
Sketch the graphs of more general functions, including identifying sta- 
tionary points and asymptotes. 
Use log-linear plots to recognize relationships of the form 
y= Ae™. 
Use log-log plots to recognize relationships of the form y = Az?. 
Add, subtract, multiply and divide complex numbers, and move between 
Cartesian, polar and exponential forms of a complex number. 
Find derived functions, using the table of standard derivatives and the 
rules for differentiating various types of combination of functions. 
Find indefinite integrals, using the table of standard integrals and (in 
simple cases) the rules for integration by substitution and for integration 
by parts. 
Find definite integrals (of suitable functions). 
Use the computer to: simplify and expand algebraic expressions, graph 
a function, solve an equation (numerically or symbolically), find deriva- 
tives, find integrals. 


The formulae are all given in 
the Handbook. 


Solutions to the exercises 


Section 1 


1.1 (a) (i) 6.48235 x 10* (Count the number of 

places the decimal point is moved to the left to find 

the power of 10.) 

(ii) 73 x10 

(b) (i) y lies between 127.683 — 0.006 = 127.677 and 

127.683 + 0.006 = 127.689; that is, in the interval 
(127.677, 127.689]. 

(ii) We cannot be certain of the fifth significant figure 

(it could be either 7 or 8), so we can only give y as 127.7 

(to four significant figures). 

(c) The condition |” — 2.763] <5 x 10~* is equivalent 

to —5 x 10-4 < x — 2.763 <5 x 1074. 

So « <2.763+5 x 10-* = 2.7635 

and x > 2.763 —5 x 1074 = 2.7625. 

So 2.7625 < x < 2.7635; that is, x lies in the interval 
(2.7625, 2.7635). 


Since 2.7635 does not lie in this interval, we can be 
certain that « = 2.763 to three decimal places. 


Section 2 


2.1 Because the reservoir is initially at 60% capacity, 

Vo = 0.6C. 

The volume V will have reduced to 20% of C when 
0.2C =0.6C —10t, i.e. when 0.4C = 10t. 

Thus crisis measures will be needed when t = 0.04C. 


(Note that, because of the domain conditions, this 
solution is valid only if 0.04C < 72000, i.e. for 
C < 1800000.) 


2.2 (a) We need 
2000 = 5(—3600) + c = —18000+ «. 
Hence c = 2000 + 18000 = 20 000. 
(b) (i) The cutter catches the boat when X = Y, ice. 
when 
7t = 5t + 20000. 
This gives 2t = 20000, so t = 10000. 
10000 seconds is 2 hours, 46 minutes and 40 seconds. 
So the cutter catches the boat at around 2.50am. 


(ii) At t = 10000, both X and Y are equal to 70000. 
So the cutter catches the boat 70km from A, which is 
inside territorial waters. 


2.3 Multiplying the first equation by 3 gives 


15,,__ 57 
3u- sus >. 


Subtracting this from the second equation gives 
(4+ 2)v =—-29- 2, 


i.e. By = —1, so y = —18 — _5, 


Solutions to the exercises 


Substituting this into the first equation gives 
2u — 5(—5) = 19, 
so u = (19 — 25)/2 = —3. 
Thus the solution is u = —3, v = —5. 
(It is good practice to check solutions where you can, 


and this is easily done here. With u = —3 and v = —5, 
we have 


2u — 5v = 2(—3) — 5(—5) = -6+ 25 = 19, 

3u + 4v = 3(—3) + 4(—5) = —9 — 20 = —29, 
so these values of u and v do satisfy the given equa- 
tions.) 


2.4 (a) Using formula (2.8), we obtain 


—-7T+ J7?-4 x 2x (—4) 
a 
2x2 
_ —7+ V/49 + 32 
7 4 
—-7+9 
ieee ins - or —4,. 
(Again, these solutions can be checked by substitution 
into the equation. For example, with « = —4, 


Qe? + 7 —4 = 32-28 -4=0, 
as required.) 
(b) We obtain x = —3 or 2. 


25 c= —2k + \/4k? — 4m(mw?) 
2m 

= —2k+4 2V k2 = m2w2 

— 2m 

fy vi mie 

— om m 

— kV kR = m?2w? 

=e Tee 

hy [Rami 

— mM m2 

eee a ey eee 
m m? 


Putting K = k/m, this gives « = —K + VK? — w?, as 
required. 

26\(2) oreo" =2 

(b) @®/e° =a? =a? (or 1/a’) 

(c) (a3)5 = a3%5 = ald 

(d) (271)4 x 43 = 27-4 x (22)8 = 2-4 x 26 = 9? =4 
(e) 8-1/3 = 1/81/83 = 1/48 = 3 

16S 06" a7) =o Ss 


@) (9° =(/8) =@=4 


(h) (1624)1/? = 164/?(a4)4/? = VTGa4*/? = 4? 
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2.7 (a) n7+m4—Inl4=In(7 x 4+ 14) =In2 
(b) Ina +2Inb— In(a?b) = Ina + In(b”) — In(a”b) 
= In(a x b” + (ab)) 
= In(b/a) 


(c) e* x (e4)? +e?” = e* x e*Y + e*™* 
xet+2Qy—22x 


(or Inb— Ina) 
=e 2y-x£ 
(d) In(e* x e¥) = In(e*t¥) =a+4+y 
(Alternatively, In(e” x e”) = In(e”) + In(e¥) = a+ y.) 
To simplify the expression in part (d), we first rear- 
ranged it in the form In(e%?”*"""9), which just equals 
something. 


=e 


In parts (e)-(g), we first rearrange the expression as 
eln(something) which also just equals something. 


(e) e2ing — eln(2”) = 72 

(f) e72ing eln(x~*) = 7-2 (or 1/2?) 

(g) exp(2Ina + In(x + 1)) = exp(In(2? x (2 +1))) 
= 27(2+1) 

(Alternatively, exp(2 Ina + In(x + 1)) 

= exp(In(x?)) x exp(In(x + 1)) = #?(a + 1).) 


2.8 Taking logs of a® = e** gives 
In(a”) = In(e**), 


This holds for all x so long as 


ie. tlna=kz. 
k=Ina. 


2.9 (a) Taking exponentials of each side of 
Iny = 2.83ln az + 0.387, 
we obtain 
exp(In y) = exp(2.831n a + 0.37), 
ie. y = exp(In(z-89)) x exp(0.37) = 1.42788. 
(b) If ny =alna +), then taking exponentials gives 
exp(In y) = exp(alnz + b), 
ie. y = exp(In(x*)) x e° = ea* = ca*, where c= e 
a positive constant. 


> is 


2.10 (a) (i) f(g(a)) = f(l— 2%) 


=e (-2") = orl 


(ii) g(f(x)) = g(e*) =1-(e°*)? =1-e 

(b) We can obtain h(a) in two steps. 

Step 1 Calculate y = 4+ 927. 

Step 2 Apply y~* to the result of Step 1. 

Then h(x) = g(f(x)), where g(x) = x~* and f(z) = 
4+ 927. (We need to exclude x = 0 from the domain 
of g, but f(a) is never equal to 0, so this is not a prob- 
lem.) 
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Section 3 


3.1 (a) The figure shows a circle of radius 1, and radii 
rotated through 0 (OA) and } (OB). We see from the 
figure that A has coordinates (1,0) = (cos0,sin0), and 
B has coordinates (0,1) = (cos $,sin $). Therefore 


sn0=0, cosO=1, sint=1, cos>=0. 
Ya 
1|B 
A 
1 «x 


(b) Since sin0 = 0, cosecO and cot 0 are not defined. 
We have 


tan0=0 and 


Since cos 5 = 0, tan } and sec 5 are not defined. 
We have 


cot 5 =0 and 


(c) We obtain: 


secQ = 1. 


cosec 5 = 1. 


inz—1 mz v3 wl 
sng= 5, cosog=y, tan § = vq: 
cosecZ = 2, sec e= Wet cot § = V/3; 
sin t = Ja cos T = Ty; tan} =1, 


cosec F = V2, sec 7 = V2, cot F = 1; 
sin= = > cos = = $, tan 5 = V3, 

WR ol 
cot 3 V3" 
(d) siné = 0 if 6=0, but also if @ differs from 0 by 
27, +47, +67, and so on. Also, siné = 0 if d= 7, or 6 
differs from a by a multiple of 27. In general, sin #@ = 0 
if 9 = nz, where n € Z. 


see HL sec Z = 
cosec 3 = Yq, sec 3 = 2, 


3.2 (a) One solution is arcsin 0.8 = 0.93 (to two dec- 
imal places). Looking at the graph of y = sin@ below, 
we see that it is symmetric about 6 = 5. So there is 
also a solution of sin = 0.8 at 6 = 7 — 0.93 = 2.21 (to 
two decimal places). These are the only solutions with 
6 between 0 and 27. (If @ is between m and 27, sin @ is 
negative.) The other solutions are obtained by adding 
multiples of 27 to these two. The solutions in the re- 
quired range are (to two decimal places): 


0.93, 2.21, 7.21, 8.50, 13.49, 14.78. 


Wy 


0.93 £n-0.93 T 


(b) We saw in Exercise 3.1(c) that tan = 1,s0@= 7 


is one solution. We can see from the graph of tan (Fig- 


ure 3.5) that there is one solution of this equation in the 


range —5 to 5, and that other solutions are obtained 


by adding multiples of 7 to this. We can express the 
full set of solutions as 


T 
qb nn, 


where n € Z. 


3.3 (a) (i) sin26 = sin(6 + 6) 
= sin 0 cos @ + cos @sin 6 
= 2sin 0 cos 6 
(ii) cos 20 = cos(6 + 0) = cos @cos 6 — sin 6 sind 
= cos” 6 — sin? 6 
(b) (i) sin(2x — 6) = sin 27 cos 6 — cos 27sin 0 
= 0x cos@—1 x sin? = —sin@ 
(ii) cos(2m — 6) = cos2mcos 6 + sin 27 sin 0 
=1xcos#+0 x sin@ = cosé 
(iii) sin(z — 0) = sina cos 0 — cos7sin0 
= 0 x cos — (1) x sin#d = sind 
(iv) cos(a — 0) = cosmcos 6+ sin msin 0 
=(-1) x cos6+0 x sin# = —cos0 
(v) sin(} — @) = sin § cos@ — cos § sin 
= 1x cos#?—0 x sin@ = cosé 
(vi) cos(} — 0) = cos § cos@ + sin § sin 
= 0x cos#+1 x sin@ = sind 


For 0 < @< $, the results of parts (v) and (vi) can be 
confirmed by examination of a right-angled triangle. 
(vii) cos (37 + x) = cos 8 cosa — sin 3 sin x 


= 0 x cosa — (—1) x sina =sinz 


Section 4 


4.1 (a) V=3+4i 
(b) pl = VF FE =5 
(c) v-w=1-31 
(d) vw = (3 —- 47)(2 


i) =6—8i- 31+ 47? =2-11i 


Solutions to the exercises 


w wo (2—i)(34 4%) : 
(e) v |ol2 25 =i tgi= ot 5 
1 Ww 2+% 9. i. 
(g) w? = (2-i)(2-i) =3- 44 
(h) 2w — 3v = 4— 22 — (9 — 127) = —5 + 10% 
4.2 We obtain 
g= Et E St $i. 
4.3 
Ya 
(0,4)¢@ 
(-V3,)) (1,1) 
ind E a e > 
(-2,0) (4,0) * 
(1-1) 4 
(a) (2,7) 
(b) (V2, 4) 
(c) (/2,—3) 
(d) (4,0) 
(e) (4,4) 


(f) This is (2,7— 9), where ¢ is as shown in the fig- 
ure. Now tang¢ = Wet so ¢ = % (see Solution 3.1(c)). 


So (—V3, 1) has polar coordinates (2, 5). 
4.4 2=2(cos(-4) + isin (-4)) =2(45- Si) 
= /2(1-7) 


4.5 This solution is identical to the solution to Exer- 
cise 4.3. 


4.6 First express 1—i in polar form, as (V2,—4). 
Then 
(2, — 9)? = (0/2), —¥E) = (27°, -57) 
= (1024, 7), 
adding 67 to the argument to obtain its principal value. 
Returning to Cartesian form, we obtain 


(1 — i)? = 1024(cos 7 + isin r) = —1024. 


4.7 z=re"*, so 
Re(ze**) = Re(re’?e™*) 
= Re(rei(*+9)) 


r cos(wt + 6). 
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Section 5 


5.1 (a) x(t) = —21sin(3t + 2), 
&(t) = —63 cos(3t + 2). 
(b) Note that 7 cos(3t + 2) = x(t) — 5, so 
—63 cos(3t + 2) = —9(a(t) — 5). 
Hence we have 4(t) = —9(a(t) — 5), or equivalently 
&(t) + 9a(t) = 45. 


5.2 The rate at which the wage bill will be rising is 
given by 
dB 


“a 10°(0.04) exp(0.04t) = 4000e°-*. 


As a fraction of the future wage bill, B, the rate of rise 
dB /dt is 
1 dB 10°(0.04) exp(0.04¢) 
Bdt 10° exp(0.04t) 
So the rate of rise is 4% per year. 


= 0.04. 


5.3 (a) a = (—0.9)(—0.5) exp(—0.5z) 
= 0.45 exp(—0.52). 
(b) F’(x) = 122? — 4, so putting x = 2 gives 
PQ) =12x2=4 = 92. 
1 


a dt2 ot? 


F'(2) = 6sec tan Z — 12sin (—) = 12V3 + 12. 
(e) g(t) = —3asin(3t + d) + 3bcos(3t + d), and then 
g(t) = —9acos(3t + ¢) — 9bsin(3t + ¢), so 

g (0) = —9acos ¢ — 9bsin ¢. 


5.4 (a) v'(z) = 3sec? z —2sinz. 


(b) au = 3Acos 3t — 3Bsin 3t. 

The value of this at t= 75 is 
3Acos = — 3Bsin 5 = 3A/V2 — 3B/V2 

— alA — B). 

(c) We need to find the fourth derivative: 
f'(t) =6cos3t, f”(t) = —18sin 3t, 
f(t) = —54cos3t, f(t) = 162sin3¢. 

Since sin 37 = —1, we obtain f@(Z) = —162. 

3 
(d) f= 1+ oy?" 
dz c dz 
(e) dx ce+d’” du 


= 5 when x =0. 
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5.5 (a) This is a quotient. We obtain 
dy — (=)(x? + 1) — (Inz)(2z) 


dx (a? 21)? 


a+a-!—2¢lnex 
(+P 


lag 
(b) f’(t) = 5t*n(3t + 4) + emcee 
(c) g(t) = Asin(At+ C) + (At+ B)Acos(At + C), so 
g' (0) = AsinC + ABcosC. 
(d) For a(t) = e~** sin 4t, we want «(t) and #(t). Us- 
ing the Product Rule: 
£(t) = —3e~** sin 4t + e3*(4 cos 4t) 
= e **(4cos 4t — 3sin 4t). 
Using the Product Rule again: 
#(t) = —3e~**(4cos 4t — 3 sin 4t) 
+ e~**(—16 sin 4t — 12 cos 4t) 
= e *(—7sin 4t — 24cos 4t). 


5.6 In each case, we suggest an intermediate variable, 
u, but (except in part (a)) omit details. 
(a) Setting u = ¢?, we obtain y = e”, dy/du = e“ and 
du/dt = 2t, so 
aa 
dt  dudt 
(b) Setting u = 3x3 + 4, we obtain 
f' (x) = 6(3a3 + 4)°(9x?) = 5427(3x3 4 4)5. 


(c) Setting u = 3v + 4, we obtain 


e“2t = 2texp(t). 


< = 3sec?(3u + 4). 


(d) Setting u = 4 — 2, we obtain 


'(z) = 4(4 — 2?)-/?(-2z) = a 
e) Setting u = 1+ 227, we have 
(e) é 
f(z) = (1+ 22”) 80? yy 8/2 
(see Example 2.3), so 
—5/2 —6x 
f(a) =—3 (1+ 207)? (42) = ———_.. 
: (VI + 222)” 
5.7 (a) This is a composite function, with 
y = secu, U= aoe 


Here wu is a quotient, and 
du 1(a +1) — 2(2z) 1-2? 


dx (ae 1) (x2 +1)?" 
Then, using the Chain Rule, 

dy _dydu 

dx  dudx 


1 
= secutanu 
oe 


(b) This is a product, but the second part of the prod- 
uct is a composite function. If v = exp(t? + 1), then 

d 
a = 3#” exp(t® +1) 
(using the intermediate function u = t? + 1). Then, us- 
ing the Product Rule, 

d 
= = 2texp(t? + 1) + t?(3¢? exp(t? + 1)) 

= (3t* + 2#) exp(¢* + 1). 


5.8 (a) (i) The Product Rule gives 


dia 2 dy 
eae = = 
Tn (ay) = 2ry +x Ae 
(ii) The Composite Rule gives 


dy 
——s 3 = 205 
dn mail dx’ 
(b) Using implicit differentiation, we obtain 


dy dy 
Bs Sai te 2 Cee 2: 
3a" + Qry eee en 
When x = —1 and y = 1, this gives 
dy | dy 
3-24 —+3— =0. 
7 dx v dx 
Hence dy/dx = —+, and the required gradient is —. 


5.9 We have f’(t) = —2sin 2t + 2i cos 2t. 
Then f”(t) = —4 cos 2t — 4i sin 2t. 


5.10 To test for stationary points, use the Product 
Rule to find 


y' (x) = —2e~2* + 4(2(a + 1))e72* = (w — le" ?*. 
So y’(x) = 0 only at « = 1. The derivative is negative if 


x <1and positive if x > 1, so this is a local minimum. 
We have y(1) = 2.574 (to three decimal places). 


5.11 (a) f(v) =0 only when v = 0. 
The denominator 4 + 1.5v + 0.008v? is positive for all 
v > 0, so f(v) is defined for all v > 0. 
As v—> oo, f(v) > 0. 
To find any stationary points, differentiate f(v) using 
the Quotient Rule, to obtain 
fv) = (4+ 1.5u + 0.008v) — v(1.5 + 0.016v) 
(4+ 1.5v + 0.008v?)? 
4 — 0.0080? 
(4 + 1.5v + 0.008v?)? ° 
This is 0 when v = +500 = +22.36 (to two decimal 
places). 


The negative stationary point is outside the domain 
(v > 0), so we need consider only v = 22.36. For 
uv < 22.36, f’(v) > 0, while for v > 22.36, f’(v) < 0. 
Therefore v = 22.36 is a local maximum. We have 
f (22.36) = 0.538 (to three decimal places). 


A sketch graph of f is shown below. 


Solutions to the exercises 


(b) From the graph we see that the global maximum 
of f occurs at the local maximum, i.e. v = 500. The 
global minimum occurs at the endpoint of the domain, 
Le. vu = 0. 


Section 6 


In all solutions for this section, c is an arbitrary con- 
stant. 


6.1 (a) Using the Product Rule for derivatives, 


d x 1. 
cos az 4 3 sin ax + Cc 
dx a a 
1 £ . a 
= ——cosaz + (—— ) (—asinazx) + — cosax 
a a a 
= xsinax. 
Therefore 
; x I 
xsinax dx = ——cosax + — sinax + ¢, 
a a 


so verifying the given integral. 


(b) Using the Composite Rule for derivatives, 


d 1 
— (-= In(cos ax) + c) 
dx a 


_ 1 HE a ae 
7 a) cosax dx pee 


1\ —asinax 
7 a COS ax 
= tanaz, 


so verifying the given integral. 


6.2 (a) Ifa =tany, then, differentiating with respect 
to x, we obtain 
d d dy 9 dy 
1 = it = — t —$. = 5 
a any) dy any) dz 7 8° Yan 
Then, using the formula sec? y = 1+ tan? y, and the 
fact that x = tan y, we obtain 


d d 
1 = (1+ tan? y) a = (1+ 27) a 
dx dx 
Therefore 
dy 1 
dx 1+22° 


Now, if = tany, then y = arctan 2, so 


1 
—~ dx = arctanz +c. 
1+ 2? 
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(b) If z =siny, then, differentiating with respect to 2, 
1 = cos dy 
8 F ae 


Using sin” y + cos? y = 1, and the fact that cosy > 0 


because —5 < y < 5, we have 


d / d 
cosy =vl sin?y # 
d 
— Vi-w = (since siny = 2). 
x 


Therefore 
dy _ 1 
dx J/1l—z?" 


Now, if  =siny, then y = arcsin x, so 
/ —_ in + 
>. 42 = _ arcsin & c. 

V1 — 2? 


This corresponds with the Handbook entry for 


a with a= 1. 
Jae — 72 ; 


6.3 We have 
[eos ax dx = } fo + cos 2ax) dx 


dole 


(2+ 5, sin2ar) +e 
2a 

1 L 2 

= gut An eee 

6.4 (a) 46°" +c 

(b) 2tan3t+c 


(c) This does not quite match a Handbook entry as it 
stands. We have 36 + 4x2? = 4(9 + 2”), so 


1 1 
ey, See ee 
| eam a cess " 


II 
Ale 
© 
+} ya 
Le] 

Q 

S 


; v 
= jp arctan (5) + ¢. 
(d) For y < 3, the integrand is positive, and we have 
1 
dy = —$1 2y) +e. 
l= y = —3n(3 — 2y) +e 


(e) For y > 3, the integrand is negative, and we have 


1 
d = 
cay a 


= —}In(2y—3)+c. 


+ In(—3 + 2y) +¢ 


6.5 (a) —3sin(—2t) —2cos4t+c. Note that you 
can use cos @ = cos(—6) to simplify the integrand (or 
use sin? = —sin(—6) to simplify the result) to obtain 
3 sin 2t — 2cos4t +c. 


(b) arcsin(t/3) +c 


(c) [=a [ (se+2) dt 


= 3t°+7In(-t)+e (t<0). 
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(d) Fort >0, 
/ (21mtan = 7) dt = 2t(In(4t) — 1) —2Int +c. 


(e) Choose a = 1, b = —1 in the standard integral, and 
note that we are in the case -1 <a <1: 


cesyezs) dx = 5!n (=) 46, 


6.6 (a) Using v? —k? = (v—k)(v+k), and tak- 
ing a=k and b= ~—k in the Handbook entry for 


/ = dx, we obtain 
| a—e®- | oer” 


1 k-v 
= 3m (3) te (-—k<u<k). 
(b) Since a — bv? = —b(v? — a/b), we can use part (a) 


with k = \/a/b to obtain (for —,/a/b < v < \/a/b) 
1 1 1 
lmee-s/ ea” 
al al In a/b—v ts 
= (ayer) v4 Japs 


4 In va—vvb LC 
2V/ab vvb+ Va _ 


6.7 (a) Ifu=2+ 4y?, then du/dy = 12y?. So 


fv exp(2 + 4y*) dy = 4 | 12y? exp(2 + 4y”) dy 


= J. exp(2 + 4y?) + ¢. 
(b) If u=siny, then du/dy = cosy. So 
fecosusin® ydy = fw? du = sue +c= sin ytc. 
(c) If u=1-—¢?, then du/dt = —2t. So 


[eviqPa=-3 f -ev/t—Fat 


=-} fv? au 


=-3Gu0?) +6 
= —3(v 1 _ t?)3 +c. 
(d) Ifu=1+4 2?, then du/dx = 22x. So 


x 1 24 1 2 
[Ee t/a a a 


since the integrand is of the form g/(a)/g(a) with 
g(x) > 0. 


(e) Ifu=1+4sin7t, then du/dt = 2sintcost = sin 2t, 
using the trigonometric formula for sin2t. So the in- 
tegrand is of the form g/(x)/g(x) with g(x) > 0, and 
hence 


in 2t 
| ep een tin?) +0 
1+sin*¢ 


(f) Using Equation (6.5) with g(y) = 1—y?, so that 
g'(y) = —2y, we have (for y 4 +1) 


y el —2y i 2), 
[ee bf oa 5mm|l—y|+e. 


6.8 (a) Take f(x) = and g'(x) =e~*, so f’(x) =1 
and g(x) = —e~*. Then 


jae a ae fre) ae 
= xe 7 + oz dx 


=-re*—-e *+c 
=—-(#+1)e* +e. 
(b) Take f(x) = 2? and g’(z) = e-*, so f’(x) = 2a and 
g(x) = —e~*. Then 
pee dz = x*(—e-*) [x e ”)dx 
= ge F + 2 f xe dx 


=—2'¢ 7 + 2(—(@ + Ie * + 8) 
(by part (a), where 6 is an arbitrary constant) 

= —(a74+2r+2)e"* +e 
(where c = 20 is an arbitrary constant). 


3/2 1 3/2 1 
6.9 / ——_{dz= 1/ > dz 
0 69 +422 4Jo 342? 


fi 2\ 13 
=A 3 arctan 3 
2 2 0 


(arctan 1 — arctan 0) 


1 
6 
=}(3-0=% 


500 
2rr(0.1 + 0.002r) dr 


7 ; r3 500 
= 7 |0.1r* + 0.004 
3/ Jo 


5003 
=T (01 x 5007 + 0.004 x ] 


6.10 (a) : 


= 6.0 x 10° (to two significant figures). 


Solutions to the exercises 


(b) Since exp(a — 2y) = e*e~ 74, we have 


[ext — 2y) dy = e* / e Y dy 


-2 
= see Me 


= —Jexp(a—2y) +e. 
(The same result can be obtained using integration by 
substitution with u = a — 2y.) 


(c) For 0 <u <1, we have 3u+5 > 0, so 


1 
1 
| du = 
0 3u+ 5 


= 5(In8 —In5) = $n. 


(d) For —3 < u < —2, we have 3u+5 <0, so 


— 


+In(3u + 5)| 


—2 


-2 
u 1 

J rege (bins 5)] 3 

= #(n1—In4) =—-jln4. 


(e) Substitute u = 3t +a, so du/dt = 3. Then 
[ines a) i= 1 [ sin(ae+a) dt 


=  [ nudu 


= 7u(In |u| —1)+c¢ 
= 5(3¢ + a)(In|3¢ + a] —1) +6. 
(f) Use integration by parts with f(t) = t and g’(t) = 
cos(3t + a), so f’(t) =1 and g(t) = ¥sin(3t +a). Then 
[ teos(st +a) dt 
= t(§ sin(3t + a)) — / 3 sin(3t + a) dt 


= ftsin(3t +a) 4 
(g) We have 


cos(3t + a) +c. 


1 
9 


d 
— (3+ cos 2x”) = —2sin 2a 
dx 


and 3+ cos 22 > 0 for all x, so 
. sin 2x7 ies sf —2sin 2x 
9 3+ cos 2x 9 3+ cos 2x 
= —$[In(3 + cos 2x)\¢ 


= —}(In3—1n4) 


= 1443-1 )],4 
= ging = ging. 
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UNIT 2 First-order differential equations 


Study guide for Unit 2 


This unit introduces the topic of differential equations. It is an important 
field of study, and several subsequent units are also devoted to it. There are 
many applications of differential equations throughout the course. 


The subject is developed without assuming that you have come across it 
before, but the unit assumes that you have previously had a basic grounding 
in calculus. In particular, you will need to have a good grasp of the basic 
rules for differentiation and integration. (These were revised in Unit 1 of 
this course.) 


From the point of view of later studies, Sections 3 and 4 contain the most 
important material. 


The recommended study pattern is to study one section per study session, 
and to study the sections in the order in which they appear. 


You will need the computer algebra package for the course for Subsection 2.3 
and for all of Section 5. The computer work for Subsection 2.3 may be 
postponed until later (for example until your study of Section 5) without 
affecting your ability to study the subsequent sections. 
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Introduction 


An important class of the equations that arise in mathematics consists of 
those that feature the rates of change of one or more variables with respect to 
one or more others. These rates of change are expressed mathematically by 
derivatives, and the corresponding equations are called differential equations. 
Equations of this type crop up in a wide variety of situations. They are 
found, for example, in models of physical, electronic, economic, demographic 
and biological phenomena. 


First-order differential equations, which are the particular topic of this unit, 
feature derivatives of order one only; that is, if the rate of change of variable 
y with respect to variable x is involved, the equations feature dy/dx but not 
d?y/dx*, d°y/dzx°, etc. 


When a differential equation arises, it is usually an important aim to solve 
the equation. For an equation that features the derivative dy/dz, this entails 
expressing the dependent variable y directly in terms of the independent 
variable x. The solution process requires the effect of the derivative to be 
‘undone’. The reversal of differentiation is achieved by integration, so it is 
to be expected that integration will feature significantly in the methods of 
solution for differential equations. 


Integration can be attempted either symbolically, to obtain an exact formula 
for the integral, or numerically, to give approximate numerical values from 
which the integral can be tabulated or graphed. The same two approaches 
can therefore be tried to obtain solutions of differential equations, and both 
are introduced in this unit. 


Section 1 considers in detail one example of how a differential equation 
arises in a mathematical model. This is followed by some basic definitions 
and terminology associated with differential equations and their solutions. 
We also note how errors and accuracy are defined. 


Section 2 starts by looking at the direction field associated with a first-order 
differential equation. This is a device for visualizing the overall behaviour of 
the differential equation and of its solutions, and leads to a basic numerical 
method of solution known as Euler’s method. Both direction fields and 
Euler’s method are implemented in a computer subsection. 


Section 3 turns to analytic (that is, symbolic) methods of solution, consid- 
ering first direct integration and then separation of variables. 


Section 4 describes a further analytic approach to solving differential equa- 
tions, called the integrating factor method. It applies only to equations that 
are linear. Linear first-order differential equations are important in their 
own right, but also give valuable clues on how to solve linear second-order 
differential equations, which are the subject of Unit 3. 


In Section 5 you will see how each of the analytic methods from Sections 3 
and 4 can be implemented on your computer. 
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Section 1 Some basics 


1 Some basics 


Subsection 1.1 develops a mathematical model for a specific situation which 
leads naturally to a first-order differential equation. A key step in deriving 
this equation is to apply the input-output principle, which is a useful device 
for building relations between variables. 


Subsection 1.2 addresses what is meant by the term ‘solution’ in the context 
of first-order differential equations, and brings out the distinction between 
the general solution and the various possible particular solutions. The spec- 
ification of a constraint, or initial condition, usually permits us to find a 
unique function that is a particular solution of the differential equation and 
also satisfies the constraint. 


The short Subsection 1.3 provides the definition and description of numerical 
errors, in anticipation of Euler’s method in Section 2. 


1.1 Why differential equations? 


In the course you will meet many examples of differential equations. Fre- 
quently these arise from studying the motion of physical objects, but we 
shall start with an example drawn from biology and show how this leads 
naturally to a particular differential equation. 


Suppose that we are interested in the size of a particular population, and in 
how it varies over time. The first point to make is that any population size is 
measured in integers (whole numbers), so it is not clear how differentiation 
will be relevant. (Differentiable functions must be continuous, and therefore 
defined on an interval of real numbers in R.) Nevertheless, if the population 
is large, say in the hundreds of thousands, a change of one unit will be 
relatively very small, and in these circumstances we may choose to model 
the population size as a continuous function of time. We shall write this 
function as P(t), and our task is to show how P(t) may be described by a 
differential equation. 


Let us assume a fixed starting time (which we shall label t = 0). If the 
population is not constant, then there will be ‘leavers’ and ‘joiners’. For 
example, in a population of humans in a particular country, the former 
will be those who die or emigrate, whilst the latter represent births and 
immigrants. 


It is usual to express birth rates as a proportion of the current population 
size. For example, the UK Office for National Statistics quotes birth rates in 
various age groups as a number per 1000 women. Death rates are specified 
in asimilar way. To emphasize that these rates are expressed as a proportion 
of the current population, we shall use the terms ‘proportionate birth rate’ Note that in our model the 


and ‘proportionate death rate’. proportionate birth rate is 
; ; expressed as a proportion of 
For our simple model we shall ignore immigration and emigration, and con- the whole population, not 


centrate solely on births and deaths. Denote the proportionate birth rate just the number of women. 
by 6 and the proportionate death rate by c. Then, in a short interval of 
time dt, we would expect 


number of births ~ bP(t)dt, (1.1) 
number of deaths ~ cP(t)ot, (1:2) 


where P(t) is the population size at time t. 
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At this stage, we seek some relationship between the chosen variables. In 
order to find this, we make use of the input—output principle, which can 
be expressed as 


accumulation] = | input |—|output}. 


This principle applies to any quantity whose change, over a given time in- 
terval, is due solely to the specified input and output. 


The accumulation 6P of population over the time interval dt is the popu- 
lation at the end of the interval minus the population at the start of the 
interval; that is, 

6P = P(t + dt) — P(t). 


The input is the number of births (Equation (1.1)), and the output is the 
number of deaths (Equation (1.2)). The input-output principle now enables 
us to express the accumulation dP of the population over the time interval 
ot as 


OP ~ bP(t)ét — cP(t)dt = (b — c)P(t)dt. 
Dividing through by 6t, we obtain 

6P 

ot 
The approximations involved in deriving this equation become progressively 


more accurate for shorter time intervals. So, finally, by letting dt tend to 
zero, we obtain 


aP 
dt 
(This follows because 
dP _ P(t+ dt) — Pi) 
= = im 
dt dt—0 ot 
is the definition of the derivative of P.) 


(b—c)P(t). 


(b—c)P(t). 


This is a differential equation because it describes dP/dt rather than the 
eventual object of our interest (which is P itself). The purpose of this unit 
is to enable you to solve a wide variety of such equations. 


Of course, we can simplify the above equation slightly by using the pro- 
portionate growth rate r, which is the difference between the proportionate 
birth and death rates: r = b—c. Then our model becomes 

dP 

dt 
For very simple population models, r is taken to be a constant. As we shall 
see, this leads to a prediction of exponential growth (or, if r < 0, decay) 
in population size with time, as illustrated in Figure 1.1. This may be a 
very good approximation for certain populations, but it cannot be sustained 
indefinitely if r > 0. 


rTP. 


In practice, both the proportionate birth rate and the proportionate death 
rate will vary, and so therefore will the proportionate growth rate. It turns 
out to be convenient to model these changes as being dependent on the 
population size, so that the proportionate growth rate r becomes a function 
of P. The justification for this is as follows. When the population is low, one 
may assume that there is potential for it to grow (assuming a reasonable 
environment). The proportionate growth rate should therefore be high. 
However, as the population grows, there will be competition for resources. 
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This is the step that requires 


P to be a continuous (rather 
than discrete) function of t. 


r>0O 


Figure 1.1 


ov 
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Thus the proportionate growth rate will decline, and in this way unlimited 
(exponential) growth does not occur. 


A particularly useful model arises from taking r(P) to be a decreasing linear 
function of P. We shall write this as 


P You will see later why this 
r(P)=k (1 — =) ’ (1.3) particular form is chosen. 


where k and M are positive constants. Looking at this formula, you can 
see that the proportionate growth rate r decreases linearly with P, from the 
value k (when P = 0) to 0 (when P = M). 


Using this expression for r, the above differential equation satisfied by P 
becomes 


® pp(r2). om 


This is well known to biologists as the logistic equation — we shall consider 
it further in Section 2, and see how to solve it in Section 3. For now, we have 
achieved our objective of showing that differential equations arise naturally 
in modelling the real world. 


Exercise 1.1 


Suppose that a population obeys the logistic model (with the proportionate 
growth rate given by Equation (1.3)), and that you are given the following 
information. When P = 10 the proportionate growth rate is 1, and when 
P = 10000 the proportionate growth rate is 0. Find the corresponding 
values of k and M. 


1.2 Differential equations and solutions 


This subsection introduces some of the fundamental concepts associated 
with differential equations. First, however, you are asked to recall some 


terminology and notation from your previous exposure to calculus. Some of this terminology and 


beep , : . . notation was discussed in 
The derivative of a variable y with respect to another variable x is denoted — [jnjz 1. 


in Leibniz notation by dy/dzx. In this derivative expression we refer to y as 
the dependent variable and to x as the independent variable. 


There are other notations in use for derivatives. If the relation between 
variables x and y is expressed in terms of a function f, so that y = f(z), 
then the derivative may be written in function notation as f’(). 


A further notation, attributed to Newton, is restricted to cases in which 
the independent variable is time, denoted by t. The derivative of y = f(t) 
could be written in this case as y, in which the dot over the y stands for the 
d/dt of Leibniz notation. Thus we may express this derivative in any of the 
equivalent forms 


Further derivatives are obtained on differentiating this first derivative. The 
second derivative of y = f(t) could be represented by any of the forms 

d2 

aa = 9= SO. 
These possible notations have different strengths and weaknesses, and which 
is most appropriate in any situation depends on the purpose at hand. You 
will see all of these notations employed at various times during the course. 
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It is common practice in applied mathematics to reduce the proliferation of 
symbols as far as possible. One aspect of this practice is that we often avoid 
allocating separate symbols to variables and to associated functions. Thus, 
in place of the equation y = f(t) (where y and t denote variables, and f 
denotes the function that relates them), we could write y = y(t), which is 
read as ‘y is a function of t’. (You have seen examples of this in the previous 
subsection. ) 


The following definitions explain just what are meant by a differential equa- 
tion, by the order of such an equation, and by a solution of it. 


Definitions 


(a) A differential equation for y = y(x) is an equation that relates 
the independent variable x, the dependent variable y, and one or 
more derivatives of y. 

(b) The order of such a differential equation is the order of the high- 

est derivative that appears in the equation. Thus a first-order 

differential equation for y = y(x) features only the first derivative, 
dy/dzx. 


A solution of such a differential equation is a function y = y(z) 
that satisfies the equation. 


(c) 


These definitions have been framed in terms of an independent variable x 
and a dependent variable y. You should be able to translate them to apply 
to any other independent and dependent variables. Thus Equation (1.4) is 
a differential equation in which t is the independent variable and P is the 
dependent variable. It is a first-order equation, since dP/dt appears in it 
but higher derivatives such as d?P/dt? do not. By contrast, the differential 
equation 

2 
got +y’ sing = 2 
is of second order, since the second derivative d?y/dzx? appears in it but 
higher derivatives do not. 


The topic of this unit is first-order differential equations. Moreover, it con- 
centrates upon first-order equations that can be expressed (possibly after 
some algebraic manipulation) in the form 


The right-hand side here stands for an expression involving both, either or 
neither of the variables x and y, but no other variables and no derivatives. 


According to the definition above, a function has to satisfy a differential 
equation in order to be regarded as a solution of it. The differential equation 
is satisfied by the function provided that when the function is substituted 
into the equation, the left- and right-hand sides of the equation give an 
identical expression. 


You are asked to verify in the next exercise that several functions are solu- 
tions of corresponding first-order differential equations. Later in the unit, 
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Strictly speaking, this is an 
abuse of notation, since there 
is ambiguity as to exactly 
what the symbol y represents: 
it is a variable on the 
left-hand side of y = y(t) but 
a function on the right-hand 
side. However, it is a very 
convenient abuse. 


Second-order differential 
equations are the subject of 
Unit 8. 


Equation (1.4) is of this form, 
P 


with f(t,P) =kP (1 = =). 


This substitution includes the 
requirement that the function 
should be differentiable 

(i.e. that it should have a 
derivative) at all points where 
it is claimed to be a solution. 
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you will see how all of these differential equations may be solved; but even 
when a solution has been deduced, it is worth checking in the manner of this 
exercise (i.e. by substitution) that the supposed solution is indeed correct. 


Exercise 1.2 


Verify that each of the functions given below is a solution of the correspond- 


ing differential equation. 
d 
¥*(a) y = 2e” — (a? + 2x + 2); wu _ yta’. Remember that an asterisk 
dex denotes an exercise (or part 
(b) y= 422 +3; dy of one) that is considered 


2 2 dr as particularly important. 

*(c) w= 2e"/?2, wl = cu. 
27 — x? d x 
(dd) 7= (-3V3 <“u< 373); it rc (y# 0). Note that the restriction 
3 dx 3y y #0 placed on the 
*(e) y=ttet; gy=-yttt+l. differential equation in 
: : : art (d) is necessary to ensure 

*(f) y=t+Ce*; y=—-yt+tt+1. (Here C is an arbitrary constant.) ae sare ic well ae 


In the last two parts of Exercise 1.2 you were asked to verify that 
y=t+e* and y=t+Ce* 


are solutions of the differential equation y = —y+t-+1, where in the second 
case C' is an arbitrary constant. In saying that C is arbitrary, we mean that 
it can assume any real value. Whatever number is chosen for C’, the corre- 
sponding expression for y(t) is always a solution of the differential equation. 
The particular function y = t + e~¢ is just one example of such a solution, 
obtained by choosing C = 1. 


This demonstrates that solutions of a differential equation can exist in pro- 
fusion; as a result, we need terms to distinguish between the totality of all 
these solutions for a given equation and the individual solutions that are 
completely specified. 


Definitions 


(a) The general solution of a differential equation is the collection 
of all possible solutions of that equation. 


(b) A particular solution of a differential equation is a single solu- 
tion of the equation, and consists of a solution function whose rule 
contains no arbitrary constant. 


In many cases it is possible to describe the general solution of a first-order 
differential equation by a single formula involving an arbitrary constant. For 
example, y = t+ Ce is the general solution of the equation y = —y+t+1; 
this means that not only is y= t+ Ce‘ a solution whatever the value of C, 
but also every particular solution of the equation may be obtained by giving 
Ca suitable value. 
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*Exercise 1.3 
(a) Verify that, for any value of the constant C, the function y = C — ze 


is a solution of the differential equation 
dy =e". 
dx 


(b) Verify that, for any value of the constant C, the function u = Ce’ —t—1 
is a solution of the differential equation 


u=t+u. 
(c) Verify that, for any value of the constant C’,, the function 
f= CMe 
1+ Cekt 


is a solution of Equation (1.4) on page 65. 


As you have seen, there are many solutions of a differential equation. How- 
ever, a particular solution of the equation, representing a definite relation- 
ship between the variables involved, is often what is needed. This is achieved 
by using a further piece of information in addition to the differential equa- 
tion. Often the extra information takes the form of a pair of values for the 
independent and dependent variables. 


For example, in the case of a population model, it would be natural to 
specify the starting population, Po say, and to start measuring time from 
t = 0. We could then write 


P= Pp) whent=0,_ or, equivalently, P(0) = Pp. 


A requirement of this type is called an initial condition. 


Definitions 
(a) An initial condition associated with the differential equation 
dy 


specifies that the dependent variable y takes some value yo when 
the independent variable x takes some value xp. This is written 
either as 


y = yo when x = x9 
or as 
y(o) = Yo- 
The numbers x9 and yo are referred to as initial values. 


(b) The combination of a first-order differential equation and an initial 
condition is called an initial-value problem. 


The word ‘initial’ in these definitions arises from those (frequent) cases in 
which the independent variable represents time. In such cases, the differen- 
tial equation describes how the system being modelled behaves once started, 
while the initial condition specifies the configuration in which the system is 
started off. In fact, if the initial condition is y(ao) = yo, then we are often 
interested in solving the corresponding initial-value problem for x > x9. 
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If x represents time, then 
x > Xo is ‘the future’ after the 
system has been started off. 
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We usually require that an initial-value problem should have a unique solu- 
tion, since then the outcome is completely determined by how the system 
behaves and its configuration at the start. Almost all the differential equa- 
tions in this course do have unique solutions. 


Example 1.1 

Using the result given in Exercise 1.3(b), solve the initial-value problem 
dy 
= ai 
tty yO) 

Solution 


From Exercise 1.3(b), on replacing the variables t, u by x, y, respectively, 
the general solution of the differential equation here is 

y=Ce* —x-1. 
The initial condition says that y = 1 when x = 0, and on feeding these values 
into the general solution we find that 

1=Ce°-0-1=C-1. 


Hence C' = 2, and the particular solution of the differential equation that 
solves the initial-value problem is 


y=2e*—-a2-1. Hf 


Exercise 1.4 


The size of a population (measured in hundreds of thousands) is modelled 
by the logistic equation 


dP P 
= =P (1-37), P(0) =1, 


where k = 0.15 and M = 10. 


*(a) Use your answer to Exercise 1.3(c) to find a solution to this initial-value 
problem. 


(b) Can you predict the long-term behaviour of the population size from 
your answer? 


Finally in this subsection, note that one needs to keep an eye on the domain 
of the function defining the differential equation. ‘Gaps’ in the domain 
usually show up as some form of restriction on the nature of a solution 
curve. For example, consider the differential equation 

dy 1 


a (1.5) 


It turns out that there are two distinct families of solutions of this equation, Figure 1.2 

given by y=Inz+C (if x > 0) and y= In(—z) + C (if <0). These two  gince |x| = —2 if « <0, you 
families of solutions are illustrated in Figure 1.2. Notice that the right-hand can see that this agrees with 
side of Equation (1.5) is not defined at x = 0, and that there is no solution what we know from Unit 1, 

that crosses the y-axis. namely that 


This unit deals with numerical and analytic (symbolic) methods of solving / et da = In |a}. 
differential equations. However, before we can discuss numerical methods, 

we need to know something about the way that errors and accuracy are 

described: this is the topic of the next subsection. 
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1.3 Approximations in calculations 


We often find that we need to make a calculation based on numerical values 
that are not exact — for example, there may be limitations on the accuracy 
to which a measurement can be taken. Finding a numerical solution to a 
differential equation almost always involves inexact arithmetic: any calcula- 
tor or computer will have some limit on the accuracy (number of significant 
figures) to which a decimal can be stored, and rounding errors will occur. 


In fact, using a numerical method usually involves repeated calculations, so 
inaccuracies may build up and have a significant effect on the result. When 
using a numerical method, one wants (if possible) to know how accurate 
the result is. With this in mind, we recall a few basic ideas relating to 
approximation and accuracy. 


One simple form of calculation is the evaluation of a function. Suppose that 
we want the value of f(z) for some function f, and use a value of 7 which 
is rounded to three decimal places, that is, 3.142. Since this is the value of 
a to three decimal places, we know that 


|3.142 — 7| < 0.0005 = 5 x 10-4 


(since 3.1415 < m < 3.1425). This gives us some idea of how accurate 3.142 
is as an estimate of 7. In general, we refer to the difference 


approximate value — true value 


as the error in the approximate value, and to the modulus of this difference, 
that is, 


|approximate value — true valuel, 


as the absolute error in the approximate value. So if € is the absolute 
error in using 3.142 as an estimate of 7, then we know that 


€ < 0.0005. 


In this context, the quantity 0.0005 is referred to as an absolute error 
bound. 


If we take 7 to be 3.142, what is the consequent error in f(z)? This will 
depend on the function f. 


Exercise 1.5 


For each of the functions f given below, use your calculator to find f(3.142). 
Then estimate f(m) more accurately, using 7 = 3.141 5926 (to seven decimal 
places). Hence estimate the error in using f(3.142) as an approximation 


to f(z). 
(a) f(z)=23 ~~ (b) f(x) = el 


In Exercise 1.5(b), you saw that an absolute error of less than 0.0005 in 
the value of 7 results in an absolute error of the order of 1.8 x 10! in 
the calculated value of f(z), for f(x) = e!°”. For this function f, errors 
are severely magnified! However, the situation is not quite so bad as this 
statement might suggest. The calculated value of f(z) is not completely 
unreliable — it is accurate to two significant figures. The value of f(z) is 
itself very large (4.4 x 1013), so an error of 1.8 x 10" is not quite so serious 
as it sounds. We often want a measure of ‘error’ that takes into account the 
size of the error relative to the size of the number being calculated. 
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Notice that, by definition, the 
absolute error is always 
greater than or equal to zero, 
whereas the error can be 
positive or negative. 


You are welcome to use your 
computer rather than a 
calculator if you prefer. 


Section 2 Direction fields and Euler’s method 


We define the relative error as 


approximate value — true value 


true value 


So the relative error in f (3.142) as an estimate of f(z) here is (roughly) 


A relative error of this size corresponds to a value that is accurate to two 
significant figures, as obtained in Exercise 1.5(b). Relative errors provide 
a guide to the number of significant figures that can be relied on, while Usually, a relative error of 


absolute errors relate to the number of decimal places that are accurate. 0.5 x 10” corresponds to a 
value that is accurate to n 


significant figures. 


End-of-section Exercises 


Exercise 1.6 

(a) Verify that, for any value of the constant C,, the function 
y=aresing+C (-l<a<1) 

is a solution of the differential equation 

dy 1 
ao ee 

(b) Using the result of part (a), find the solution of the initial-value problem 
dy 1 1 


Exercise 1.7 
(a) Verify that, for any value of the constant C,, the function 
a=tan(t+C) (-$<t+C< 4) 
is a solution of the differential equation 
&=14+2”. 
(b) Using the result of part (a), find the solution of the initial-value problem 
t=lt+a’, x(f)=1. 


2 Direction fields and Euler’s method 


Subsection 2.1 shows that qualitative information about the solutions of a 
first-order differential equation may be gleaned directly from the equation 
itself, without undertaking any form of integration process. The main con- 
cept here is the direction field, sketches of which usually give a good idea of 
how the graphs of solutions behave. 


Direction fields can also be regarded as the starting point for a numeri- 
cal (that is, calculational rather than algebraic) method of solution called 
Euler’s method, which is described and applied in Subsection 2.2. 


In Subsection 2.3 you will see how both direction fields and Euler’s method 
can be implemented on your computer. 
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2.1 Direction fields 


We start this subsection by considering what can be deduced about solutions 
of the differential equation 


dy 
2s 2.1 
Tg Af Y) (2.1) 
from direct observation of this equation. 
In Section 1 we encountered the logistic equation 
dP P 
—=kP(1- — 2.2 
dt ( M ) 2) 


where & and M are positive constants. In certain circumstances this is a 
useful mathematical model of population changes, in which P(t) denotes 
the size of the population at time t. The right-hand side of this equation 
is equal to zero if either P =0 or P= M. Hence, since dP/dt = 0 in both 
cases, each of the constant functions P = 0 and P= M is a particular 
solution of Equation (2.2). Within the model, these solutions correspond to 
a complete absence of the population (P = 0), and an equilibrium population 
level (P = M) for which the proportionate birth and death rates are equal. 


Such spotting of constant functions that are particular solutions is useful 
on occasion but of limited applicability. In general, more useful information 
can be deduced from the observation that, for any given point (x,y) in the 
plane, the equation 


(2.1) 


describes the direction in which the graph of the particular solution through 
that point is heading (see Figure 2.1). This is because if y = y(x) is any 
solution of the differential equation, then dy/dx is the gradient or slope of 
the graph of that function. Equation (2.1) therefore tells us that f(x,y) 
represents the slope at (x,y) of the graph of the particular solution that 
passes through (z,y). If the slope f(x,y) at this point is positive, then the 
corresponding solution graph is increasing (rising) from left to right through 
the point (x, y); if the slope is negative, then the graph is decreasing (falling); 
and if the slope is zero, then the graph is horizontal at the point. 


When looking at f(x,y) in this light, it is referred to as a direction field, 
since it describes a direction (slope) for each point (x,y) where f(x,y) is 
defined. 


Definition 


A direction field associates a unique direction to each point within 
a specified region of the (x, y)-plane. The direction corresponding to 
the point (x,y) may be thought of as the slope of a short line segment 
through the point. 


In particular, the direction field for the differential equation 


d 
7 = f(x,y) 


associates the direction f(x,y) with the point (2, y). 


Direction fields can be visualized by constructing the short line segments 
referred to above for a finite set of points in an appropriate region of the 
plane, where typically the points are chosen to form a rectangular grid. 
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Here we have 


f(t,P) =kP (1 2 x) 


Figure 2.1 A graphical 
representation of the slope at 
the point (Zo, yo) 


For example, if 

f(x,y) =x+y, then the 
slope at the point (1,2) is 
f(1,2) =1+2=83, the slope 
at the point (2,—7) is 
f(2,-7) =2—7=—5, and 
the slope at the point (3, —3) 
is f(3,-3) =3-3=0. 


Section 2 Direction fields and Euler’s method 


An example is shown in Figure 2.2, which corresponds to the differential 
equation 

“ =@+y. (2.3) 
In this case the chosen region is the set of points (x, y) such that —2 <a <2 
and 0 < y < 2, and the rectangular grid consists of the points at intervals 
of 0.2 in both the x- and y-directions within this region. 


Here f(z,y) =a+y. 
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Figure 2.2 


From this diagram, we can gain a good qualitative impression of how the 
graphs of particular solutions of Equation (2.3) behave. The aim is mentally 
to sketch curves on the diagram in such a way that the tangents to the curves 
are always parallel to the local slopes of the direction field. For example, 
starting from the point (—1,0.5) (that is, taking the initial condition to be 
y(—1) = 0.5), we expect the solution graph initially to fall as we move to 
the right. The magnitude of the negative slope decreases, however, and 
eventually reaches zero, after which the slope becomes positive and then 
increases. On this basis, we could sketch the graph of the corresponding 
particular solution and obtain something like the curve shown in Figure 2.3. 
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Exercise 2.1 

(a) Part of the direction field for the logistic equation 
dP P a : F 
— = P({—-—_ This is Equation (2.2) with 
dt 1000 k =1 and M = 1000. 


is sketched in Figure 2.4. Using this diagram, sketch the solution curves 
that pass through the following points: 


(0,1500), (0,1000), (0,100), (0,0), (0, —100). 


From your results, describe the graphs of particular solutions of the 
differential equation. 
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1000-# —-—-—-—-—-—-—--—--—--—--—--—--—---—--—---- = 
ee a i i a i i i i i i a a i ee 
a a an an an an an an an an an an an aan an an an aa 
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Figure 2.4 


(b) What does your answer to part (a) tell you about the predicted be- 
haviour of a population whose size P(t) at time ¢ is modelled by this 
logistic equation? 


Drawing by hand precise grids of line segments to represent direction fields 
is not a good use of your time. However, it is a task that your computer 
can be programmed to perform, as you will see in Subsection 2.3. Before 
investigating this, you will see in Subsection 2.2 how the concept of direction 
fields helps in constructing approximate numerical solutions for first-order 
differential equations. 


2.2 Euler’s method 


In the previous subsection it was suggested that the graphs of particular 
solutions of a differential equation 

d 

7 = f(a, y) 
could be ‘mentally sketched’ on a diagram of the direction field given by 
f(x,y). This was to be done in such a way that the tangent to the solution 
curve is always ‘parallel to the local slope’ of the direction field. While this 
gives a good visual image of the connection between the direction field and 
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the graph of a solution, it is somewhat short on precision. We could not 
expect, by this approach, to predict with any accuracy the actual solution to 
an initial-value problem. That task is the subject of the current subsection, 
which shows how an initial-value problem 

dy 


= <7 f(z,y), y(o) = V0; (2.4) 


may be ‘solved’ by calculational means. However, the direction field diagram 
is still of use in explaining how this numerical method arises. 


Suppose that instead of trying to sketch a solution curve to fit the direction 
field, we move in a sequence of straight-line steps whose directions are gov- 
erned by the direction field. The aim is to produce a sequence of points that 
provide approximate values of the solution function y(«) for the initial-value 
problem at a sequence of x-values. The steps are constructed as follows. 


Corresponding to the given initial condition y(a9) = yo, there is a point Po 
in the (x, y)-plane with coordinates (29, yo), and this is our starting point. 
At Po, the direction field f(x,y) defines a particular slope, namely f (29, yo). 
We move off from Po along a straight line that has this slope, and continue 
until we have travelled a horizontal distance h to the right of Pp. The point 
that has now been reached is labelled P,, as in Figure 2.5. 


ya 
slope =f(o, Yo) meee 
i }y,- 
P 1— Yo 
(Xo. Yo) ieee | 

is h a 

_ 
0 Xp x, x 

Figure 2.5 


The idea is that the point P,, whose coordinates have been denoted by 

(21, Yi), provides an approximate value Y; of the solution function y(x) at The reason for using Yj here, 
x = x1. Now, unless the solution function follows a straight line as 2 moves rather than yj, will be 

from 29 to 21, Yj is unlikely to give the exact value of y(a 1). However, the explained shortly. 

hope is that, because we headed off from xp along the correct slope, as given 

by the direction field, Y; will be reasonably close to the exact value. Before 

worrying about accuracy, let us continue with the construction of the points 

in our sequence. 


The next thing that we need to do, before proceeding to the second step 
in the construction process, is determine formulae for x; and Y; in terms 
of x9, yo, h and f(xo, yo). By the construction described, as the point P; 
is reached from Po by taking a step to the right of horizontal length h, we 
have 

ty = 29 +h. (2.5) 
We can express Y; in terms of other quantities by equating two expressions 
for the slope of the line segment PoP, 

Yi — yo 

h 

and then rearranging to give 


Yi = yo + hf (Zo, yo). (2.6) 


This completes the first step, and we now take a second step to the right. 


_ f (xo, Yo); 
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The direction of the second step is along the line with slope defined by the 
direction field at the point P,, namely f(x;, Y,). The second step moves us 
from P, through a further horizontal distance h to the right, to the point 
labelled P, as illustrated in Figure 2.6. This point provides an approximate 
value Y2 of the solution function y(x) at x = x2. 


yA 


slope = f(x, Y,) Py (X, Yo) 


ay 


Figure 2.6 


As in the first step, we need now to express the coordinates (x2, Y2) of P2 
in terms of 21, Yi, h and f(a, Yi). We have 


LQ= Ly + h (2.7) 
and (equating two expressions for the slope of the line segment P; P2) 
Y2-Y 
“> = fle1,¥i), 
which can be rearranged to give 
Yy=Y,+ hf (a1, Y1). (2.8) 


Having carried out two steps of the process, it is possible to see that the 
same procedure can be applied to construct any number of further steps, 
and we next generalize to a description of what happens at the (i+ 1)th 
step, where 7 represents any non-negative integer. 


Suppose that after 7 steps we have reached the point P;, with coordinates 
(a;,Y;). For the (i+ 1)th step, we move away from P; along the line with 
slope f(x;, Y;), as defined by the direction field at P;. After moving through a 
horizontal distance h to the right, we reach the point P;,1, whose coordinates 
are denoted by (2j41, Yi41), as illustrated in Figure 2.7. The point Pi; 
provides an approximate value Y;+, of the solution function y(x) at x = 2414. 


yA 
Pig (Xia Visi) 
slope =f(x;, Y; 
Viet 7 Y; 
Z | h 
nee 
- 

T > 
0 xj Xi+t m 
Figure 2.7 
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Arguing as before, we have 
Tip = Ujth (2.9) 


and (equating two expressions for the slope of the line segment P;P;1) 


Yin - ¥i 
re = Fi as, Yi) 


which can be rearranged to give 
Yi =) + ay a Py). (2.10) 


Note that Equations (2.5) and (2.7) are the special cases of Equation (2.9) 
for i = 0 and i = 1, respectively, and that Equation (2.8) is the special case 
of Equation (2.10) for i = 1. If we also define Yo to be equal to the initial 
value yo, then Equation (2.6) is the special case of Equation (2.10) for i = 0. 


To sum up, for the initial-value problem (2.4), we have a procedure for 
constructing a sequence of points 


P, with coordinates (%;,Y;) (¢=1,2,...), 


where the values of x; and Y; for each value of 7 are determined by the re- 
spective formulae (2.9) and (2.10). The starting point for the sequence is the 
point Po with coordinates (xo, Yo), where Yo = yo. Because the procedure 
is based on the direction field, each Y; provides an approximation at 7 = 2; 
to the value of the solution function y(x) for the initial-value problem. The 
horizontal distance h by which we move to the right at each stage of the 
procedure is called the step size or step length. 


Figure 2.8 shows the constructed sequence of points, and for comparison in- 

cludes a curve representing the graph of the exact solution of the initial-value 

problem (2.4). This makes clear that the successive points P;, P2, P3,... are 

only approximations to points on the solution curve. In fact, the situation 

shown in Figure 2.8 is typical of the behaviour of the constructed approx- 

imations, in that they gradually move further and further from the exact 

solution curve. This is because, at each step, the direction of movement is 

along the slope of the direction field at P; = (a;, Y;) and not along the slope 

of the direction field at (x;,y;), where y; = y(a;) denotes the value of the The common use of 
exact solution function at x = x;; that is, for each 2;, the slope for the next yi = y(xi) to represent the 
step is defined by a point close to the solution curve rather than by the point ¢*act solution at @ = a; 


explains why we use a 
exactly on that curve. different notation, namely Yj, 


for the numerical 
approximation to y(2;). 


Figure 2.8 
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Nevertheless, the formulae (2.9) and (2.10) provide a method for finding ap- 
proximate solutions to the initial-value problem (2.4), in terms of numerical 
estimates Y), Yo, Y3,... at the respective domain values 71, 72,73,.... This 
is called Euler’s method, and is summarized below. 


Procedure 2.1 _Euler’s method 
To apply Euler’s method to the initial-value problem 


Y= F(2,4), y(00) = wo (2.4) 


proceed as follows. 


(a) Take xp and Yo = yo as starting values, choose a step size h, and 
set 2 = 0. 


(b) Calculate the x-coordinate x;+1, using the formula 
Lind = M+ h. (2.9) 


(c) Calculate a corresponding approximation Y;,, to y(#;+41), using 
the formula 


Yo =F, hy (ead) (2.10) 


(d) If further approximate values are required, increase i by 1 and 
return to Step (b). 


Example 2.1 

Consider the initial-value problem 
dy 
= =a 
mn tty y(0) 


Use Euler’s method, with step size h = 0.2, to obtain an approximation to 
y(1). 
Solution 


We have xp = 0, Yo = yo = 1, and f(a;, Y;) = 2; + Y;. The step size is given 
as h = 0.2. Equation (2.9) with i = 0 gives 


%=r% +th=0+4+0.2 = 0.2, 
and Equation (2.10) with i = 0 gives 
¥, =Yo +hf(xo, Yo) =1+0.2x (0+ 1) =12. 
For the second step, we have (from Equation (2.9) with i = 1) 
v2 = 27, +h=0.24+ 0.2 = 0.4, 
and (from Equation (2.10) with i = 1) 
¥2 =¥, thf (#1, Vi) =1.2+0.2 x (0.2+1.2) =1.48. 


If more than a couple of steps of such a calculation have to be computed by 
hand, then it is a good idea to lay out the calculation as a table. In this 
case, by continuing as above and putting 7 in turn equal to 2, 3 and 4, we 
obtain Table 2.1. 
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The accuracy of such 
approximate solutions, and 
ways of improving accuracy, 
will be considered shortly. 


Leonhard Euler (1707-1783) 
was one of the most prolific 
mathematicians of all time. 

(His surname is pronounced 
‘oiler’.) He first devised this 
method in order to compute 
the orbit of the moon. 
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Table 2.1 

t ty Yj flee¥i) =a +, Yaa = Ye+ bf (ei. ¥i) 
0 O 1 1 1.2 

1 0.2 1.2 1.4 1.48 

2 0.4 1.48 1.88 1.856 

3 0.6 1.856 2.456 2.347 2 

4 0.8 2.3472 3.1472 2.976 64 

5 1.0 2.97664 


So, at « = 1, Euler’s method with step size h = 0.2 gives the approximation 
y(1) ~ 2.97664. 


*Exercise 2.2 


Use Euler’s method, with step size h = 0.2, to obtain an approximation to 
y(1) for the initial-value problem 


dy _ 


aa = 1: 


The solution of the initial-value problem given in Example 2.1 is in fact 
known exactly, and is y = 2e* — x — 1. Putting x = 1, this gives 


y(1) = 2e-—1-—1=3.436 56, 


correct to five decimal places. This value may be compared with the ap- 
proximation 2.97664 for y(1) obtained by Euler’s method in Example 2.1, 
and the comparison indicates that the approximation is not at all accurate. 
Indeed, the absolute error in this case is 


|2.976 64 — 3.436 56| = 0.459 92, 


which is about 13% of the exact value, and indeed not even one decimal 
place accuracy is achieved. 


Similarly, the other values Y; (¢ = 1,2,3,4) found in Example 2.1 contain 
significant absolute errors when considered as approximations to the cor- 
responding exact values y; = y(x;). This is illustrated in general terms in 
Figure 2.8, where the absolute error in approximation Y; is the vertical dis- 
tance from the point P; to the point directly above it on the exact solution 
curve. As shown there, and for reasons given earlier, the absolute error tends 
to increase as more and more steps are taken. 


The realization that Euler’s method can produce values that are poor ap- 
proximations to the exact solution of an initial-value problem invites us to 
ask whether the accuracy of the approximations can be improved, using this 
method. In fact, it is not hard to see that improvements in accuracy ought 
to be achieved by reducing the step size h. Our earlier prescription for con- 
structing the sequence of points P,, Po, P3,... from the starting point Pp 
and the given direction field amounts loosely to saying ‘match the direction 
of the solution curve at the current point, take a step, then adjust direction 
so as to try not to move further away from the curve’. It seems natural, 
therefore, that the approximations will improve if we reduce the size of the 
steps taken and correspondingly ‘adjust direction’ more frequently. This is 
illustrated for a hypothetical case in Figure 2.9. 


After each value of Y;41 has 
been calculated from the 
formula and entered in the 
last column, it is transferred 
to the Y; column in the next 
row. 


This exact solution was found 
in Example 1.1 (page 69). 
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Figure 2.9 


In fact, it can be shown that the accuracy of Euler’s method does indeed 
usually improve when we take a smaller step size. 


To demonstrate this, consider the initial-value problem from Exercise 2.2. 
This has the exact solution y = e” (as you can verify), and its value at 
x =1 is y(1) = e = 2.718 282, to six decimal places. In Exercise 2.2 you 
showed that with a step size h = 0.2, Euler’s method gives the approximation 
2.488 32 for y(1). Table 2.2 shows the corresponding results (to six decimal 
places) obtained when we apply Euler’s method to this same initial-value 
problem but with successively smaller step sizes h. 


Table 2.2 
h Approximation Absolute Number of 
to y(1) error steps 
0.1 2.593 742 0.124 539 10 
0.01 2.704 814 0.013 468 100 
0.001 2.716 924 0.001 358 1000 
0.0001 2.718 146 0.000 136 10000 


As expected, the absolute errors in the third column of the table become 
progressively smaller as h is reduced. 


Looking more carefully at these absolute errors, we notice that they seem 
to tend towards a sequence in which each number is a tenth of the previous 
one. Since each value of A in the table is a tenth of the previous one, this 
suggests that: 


absolute error is approximately proportional to step size h. 


This turns out to be a general property of Euler’s method, for sufficiently 
small values of the step size. So, not only do we know that accuracy can be 
improved by decreasing the step size h, but this general property also tells us 
that, by making h small enough, the absolute error in an approximation can 
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In Exercise 2.2, where 
h = 0.2, the value of y(1) was 
approximated by Y5. From 
the column for ‘number of 
steps’ in Table 2.2, you can 
see that y(1) is approximated 
by: 

Yio when h = 0.1; 

Yi00 when h = 0.01; 

Yi000 when h = 0.001; 

Y 10000 when h = 0.0001. 


You will see this property 
stated formally in Unit 26, 
where you will also see how 
the property can be used to 
estimate the size of absolute 
errors. 


Section 2 Direction fields and Euler’s method 


be made as small as desired. In other words, the absolute error approaches 
the limit zero as h approaches zero (as you might have expected from the 
intuitive argument preceding Figure 2.9). 


*Exercise 2.3 


Suppose that when Euler’s method is applied to the problem in Exercise 2.2, 
the absolute error in approximating y(1) is proportional to the step size h, 
for sufficiently small h. 


Use the last row of Table 2.2 to estimate the constant of proportionality, k 
say, and hence to estimate the step size required to compute y(1) correct to 
five decimal places (that is, so that the absolute error is less than 5 x 10~°). 


A few words of caution are necessary at this point. Although the absolute 
error can be made as small as we please by making the step size h sufficiently 
small, this is valid only if the arithmetic is performed using sufficient decimal 
places. Where a calculator or computer is involved, the number of decimal 
places that can be used is limited, and as a result rounding errors may 
be introduced into the calculations. After a certain point, any increase in 
accuracy brought about by reducing the size of h may be swamped by these 
rounding errors. 


Moreover, rounding errors are not the only problem. Before concluding that 

h should always be chosen to be very small, we must also consider the cost 

of this additional accuracy. Now, by cost is meant the effort involved, which 

can be measured in a variety of ways; commonly for iterative methods (such 

as Euler’s method) it is measured by the number of steps taken. In general 

for numerical methods, the greater the accuracy required, the greater the 

cost. To illustrate this, look back at Table 2.2. The last column of the 

table shows how the number of steps required for the calculation goes up 

in inverse proportion to the step size: to move from x =0 to x =1, it In general, to move from a 
takes 10 steps with step size h = 0.1, 100 steps with step size h = 0.01, to b (where 6 > a) with step 
and so on. Since, for sufficiently small h, the error in Euler’s method is Size / takes (b— a)/h steps. 
approximately proportional to the step size, it follows that for this method 

a ten-fold improvement in accuracy is paid for by a ten-fold increase in the 

number of steps required. 


So, for Euler’s method and similar methods, the choice of step size has to be 

based on a compromise between the two opposing requirements of accuracy 

and cost. Methods for choosing the step size are discussed in Unit 26, which 

also introduces other numerical methods for solving initial-value problems 

that are considerably more efficient than Euler’s method. In fact, Euler’s Greater efficiency means that 
method is not suitable for high-accuracy work. Its virtue lies rather in its the same or better numerical 
simplicity and its clear illustration of the basic principles of how differential &¢curacy is achieved with 


: . fewer numerical 
equations may be solved numerically. computations 


In any practical case, calculations of the type described in this subsection 
are ideally suited to being performed on a computer, as you will see in the 
next subsection. 
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2.3 Finding numerical solutions on the computer 


In this subsection you will see how the computer can be used to construct 
direction fields and to implement Euler’s method. 


Use your computer to complete the following activities. Pc | 


*Activity 2.1 
Plot the direction field for the differential equation 


dy _ a 
—“=r+y This is Equation (2.3) 
dx (page 73). 

in the region 
-222052, VA9S 2 


On the basis of the plot of the direction field, what can you say about the 
graphs of solutions of the differential equation? 


*Activity 2.2 


Use Euler’s method to obtain approximations to y(1) for the initial-value 


problem 

dy 

= n= 

dg tty yO) =0, 
using step sizes h = 1, 0.5, 0.2, 0.1, 0.01, 0.001, 0.0001, in turn. In each case, 
plot the graph of the solution on an appropriate direction field diagram, and 
observe how each graph compares with the previous one. 


*Activity 2.3 


Euler’s method is to be used to estimate the value of the function y(x) at 

x =0.1,0.2,...,1 for the initial-value problem 
“ =at+y, y(0)=0. 

(a) Use the step sizes h = 0.1, 0.01, 0.001, 0.0001, in turn. Compare the 
results in each case with the exact solution y = e* — x — 1, and comment 
on how the size of the absolute error varies with h. 


(b) Compare your estimates for the step sizes h = 0.1 and h = 0.01. Then 
compare your estimates for all four step sizes. What can you conclude? 


End-of-section Exercise 


Exercise 2.4 
(a) Without plotting the direction field, say what you can about the slopes 
defined by the differential equation 


(b) Verify that your conclusions are consistent with the direction field dia- 
gram in Figure 2.10. 
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Figure 2.10 


(c) On the basis of the direction field, what can be said about the graphs 
of solutions of the differential equation? 


(d) Write down the formulae required in order to apply Euler’s method to 
the initial-value problem 
dy 2 
== —1) = -0.2 
dx a x ? y( ) ? 
using a step size h = 0.1. 


3 Finding analytic solutions 


This section and the next look at methods for finding analytic solutions 
of first-order differential equations — that is, solutions expressed in terms 
of exact formulae. It is not always possible to find analytic solutions, and 
in these cases numerical methods of approximate solution, such as Euler’s 
method, are applied. Even when a formula for the solution is obtainable, 
it may be so complicated that a numerical solution is preferred. However, 
where a simple formula can be found, this is likely to be more informative 
than use of a numerical method. 


This section specializes from the form of differential equation 


considered earlier. Subsection 3.1 looks at cases in which f(z, y) is taken to 
be a function of x alone, f(x). You will see that the differential equa- 
tion can then be solved by direct integration, assuming that the neces- 
sary integration can be performed. Subsection 3.2 considers cases in which 
f(x,y) = g(x)h(y) (the product of a function of x and a function of y). 
These can be solved in principle by the method of separation of variables. 
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3.1 Direct integration 


An example of a differential equation that can be solved is 
d 
Y= 2? 
dx 
In order to do this, we need to find functions y(x) whose derivatives are x 
one such function is y = $a, There are other functions with this same 
derivative, for example y = 50° +1 and y= 50° — 2. In fact, any function 
of the form 


(3.1) 


2. 
’ 


y= ar +C, 
where C is an arbitrary constant, satisfies the differential equation (3.1). 
This is an expression for the general solution. 


The expression $x? + C is also the indefinite integral of x7: that is, 
[ea 52 +C. 


In this case, therefore, the indefinite integral of x? is the general solution of 
Equation (3.1), and a similar connection applies more generally. 


Consider the differential equation 


d 


y_ 
a 


where the right-hand side, f(x), is a function of x alone. Suppose that we 
have a particular solution y = F(x) of this differential equation; in other 
words, F(x) is an integral of f(x). In such circumstances, the general so- 
lution of Equation (3.2) is given by y = F(a) + C, where C is an arbitrary 
constant; and the indefinite integral of f(x) is given by the same expression, 


[t@ dx = F(x) +C. 


This means that the general solution of Equation (3.2) can be written down 
directly as an indefinite integral; and, if the integration can be performed, 
then the equation is solved. 


(3.2) 


Procedure 3.1 Direct integration 


The general solution of the differential equation 


(3.2) 


y= f f@)de=F@)+C, 


where F(x) is an integral of f(a) and C is an arbitrary constant. 


Once the general solution has been found, it is possible to single out a 
particular solution by specifying a value for the constant C’. As before, this 
value may be found by applying an initial condition. 


Example 3.1 

(a) Find the general solution of the differential equation 
dy _ 4-32 
aa = 6 : 
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The values C = 0, C = 1 and 
C = —2 give the three 
particular solutions 
mentioned above. 


This is hardly surprising, 
since integration ‘undoes’ or 
reverses the effect of 
differentiation. 


The function f(x) is assumed 
to be continuous (i.e. its 
graph has no breaks). 


Section 3 Finding analytic solutions 


(b) Find the particular solution of this differential equation that satisfies 


the initial condition y(0) = 3. 


Solution 


(a) On applying direct integration, we obtain the general solution 
y= fee dx = —ze ** +C, 


where C is an arbitrary constant. 


(b) In order to satisfy the initial condition y(0) = 3 (that is, y= 3 when 
x = 0), we must have 


pad t0, 
so C' = 2. The required particular solution is therefore 


y= —je* +2. Hf 


Procedure 3.1 uses x for the independent variable and y for the dependent 
variable. As usual, you should be prepared to translate this into situations 
where other symbols are used for the variables. But remember that the 
method of direct integration applies solely to first-order differential equations 
for which the derivative is equal to a function of the independent variable 
alone. Thus direct integration can be applied, for example, to the differential 
equation 


to give the general solution 
L= [costa =sint+C, 


where C is an arbitrary constant. (Here t is the independent variable and x 
is the dependent variable.) On the other hand, the differential equation 


dx 2 

dt 
cannot be solved by direct integration, since the right-hand side here is a 
function of the dependent variable, x. 


Exercise 3.1 


Solve each of the following initial-value problems. 


(a) “ 62, ars 


d 
(b) 7 =e, (0) =2. 
U 
ome er that y st f 
#(c) = 5sin2t, 4(0) —0. Remember that y stands for 


dy/dt, where t denotes time. 


The method of direct integration succeeds in solving a differential equation 
of the specified type whenever it is possible to carry out the integration that 
arises, and this task may require you to apply any of the standard techniques 
of integration, such as integration by parts and integration by substitution. Integration by parts and 


For more difficult integrals, a computer algebra package can be used. integration by substitution 
are revised in Unit 1. Many 
integrations can be performed 
by reference to the table of 
standard integrals in the 
course Handbook. 
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Exercise 3.2 


Find the general solution of each of the following differential equations. 


dy = —22 
(a) dg = 2° 
t 
*(b) p= ive (Hint: For the integral, try the substitution u = 1+ ??.) 


The answer to Exercise 3.2(b) can be generalized to any differential equation 
of the form 


dy _, f(a) 
dx f(z) 
where k& is a constant, to give the general solution 


y= kIn|f(x)|+C, 


where C is an arbitrary constant. 


(f(x) #0), 


3.2 Separation of variables 


Direct integration applies, in an immediate sense, only to the very simplest 
type of differential equation, as described by Equation (3.2). However, all 
other analytic methods of solution for first-order equations eventually also 
boil down to performing integrations. In this subsection, we consider how 
to solve first-order differential equations of the form 


dy 
where the right-hand side f(x,y) is the product of a function of x and a 
function of y; that is, equations of the form 


oH = g(a)h(y). (3.3) 


One example of this type of differential equation is 
“ =az(1+y’). (3.4) 
We divide both sides of this equation by 1+ y?, to obtain 
1 dy 
l+y2 dx 


and then integrate both sides with respect to x, which gives 


1 dy 


Applying the rule for integration by substitution (in Leibniz notation) to 
the left-hand side, we obtain 


1 dy 1 
a a nr, 
l= a l= a 


so Equation (3.5) becomes 


1 


On performing the two integrations, we obtain 


PE 


arctan y = 5a" +C, (3.6) 
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This is a simple extension of 
the result from Unit 1 that 
[ Fea-my@i+e 
f(z) 
for f(a) £0. 


Here we have g(a”) = x and 
h(y) =1+y?. 


Note that 1 + y? is never zero, 
so it is safe to divide by it. 


See Section 6 of Unit 1. 


See the table of standard 
integrals in the Handbook. 


Section 3 Finding analytic solutions 


where C is an arbitrary constant. Making y the subject of the equation, we Note that one arbitrary 
obtain the solution expression constant suffices. 


y = tan($2?+C). 
The approach just demonstrated applies more widely. In principle, it works 
for any differential equation of the form 

dy 

= h(y). . 

oF = g(a)h(y) (3.3) 
On dividing this equation through by h(y) (for all values of y other than 
those where h(y) = 0), we obtain 

1 dy 

Integration with respect to x on both sides gives 


lig cer [owe 


and, on applying the rule for integration by substitution to the left-hand 
side, this becomes 


/ oe dy = [oo dx. (3.7) This is the form that you 
hA(y) need to remember! Note that 
If the two integrals can be evaluated at this stage, then we reach an equation you can obtain it ‘informally’ 
that relates 2 and y and features an arbitrary constant. This equation is by dividing Equation (3.3) by 
the general solution of the differential equation (for values of y other than h(y), smultiplying eeoel 
those where h(y) = 0); but usually y will not be the subject of this equation pe edn 

: * two integral signs. 
It is a form of the general solution called an implicit (general) solution of 
the differential equation. (An example of an implicit solution is provided 
by Equation (3.6).) Usually, the final aim is to make y the subject of the 
equation, if possible — that is, to manipulate the equation into the form 


y = function of «x. 


This is called the explicit (general) solution of the differential equation. 


In either case (implicit or explicit), a particular solution may be obtained 
from the general solution as before, by applying an initial condition. 


The method just described for solving differential equations of the form (3.3) 
is called the method of separation of variables since, in Equation (3.7), we 
have separated the variables to either side of the equation, with only the 
dependent variable appearing on the left and only the independent variable 
on the right. The method is summarized below. 


Procedure 3.2. Separation of variables 


This method applies to separable differential equations, which are of 
the form 


oY — g(a)h(y). (3.3) 


(a) Divide both sides by h(y) (where h(y) 4 0), and integrate both 
sides with respect to x, to obtain 


/ an / gender. (3.7) 


(b) If possible, perform the integrations, to obtain an implicit form of 


the general solution. It is a good idea to check, by 
substitution into the original 
(c) If possible, rearrange the formula found in Step (b) to give y in differential equation, ee the 


terms of x. This is the explicit (general) solution. function obtained is indeed a 
solution. 
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The separation of variables method is useful, but there are some difficulties 
with it. First, it may not be possible to perform the necessary integrations. 
Second, the general solution obtained is restricted to those values of y such 
that h(y) #0. Third, it may not be possible to perform the necessary 
manipulations to obtain an explicit solution. 


Of these difficulties, the first can be overcome by use of a numerical method, 
such as Euler’s method. The second will be discussed shortly. The third 
will usually also need numerical techniques. 


It is necessary to be careful about the domain or image set of the solution 
obtained, as the following example illustrates. 


Example 3.2 

(a) Find the general solution of the differential equation 
dy x 
OO 0 
re 3y (y > 0) 


(b) Find the particular solution that satisfies the initial condition y(0) = 3. 


Solution 


(a) The equation is of the form 


where the obvious choices for g and h are 
g(z)=—x and h(y) = 1/(3y). Notice that since y > 0, 


h(y) i : 
We now apply Procedure 3.2. On dividing through by h(y) = 1/(3y) ae 
(that is, multiplying through by 3y) and integrating with respect to z, 
the differential equation becomes 


[ov dy = / —x dx. With practice, you will be 
able to move directly to this 
Evaluating the integrals gives stage, as shown in 


3.9 1.2 Procedure 3.2. 
where B is an arbitrary constant. This is an implicit form of the general 
solution. 


On solving for y (and noting the condition y > 0 given above, which 
determines the sign of the square root), we obtain the explicit general 
solution 


y= $(2B — 27), 


This can be simplified slightly by writing C in place of 2B, where C 
is another arbitrary constant. However, we need to recognize that the 
formula for y represents a real quantity greater than zero only when the 
argument of the square root is positive, so we must have C — 2? > 0. 
This in turn means that C cannot be completely arbitrary, since it must 


at least be positive. The general solution in this case is therefore Since 2? > 0 for all 2, 
C — x? > 0 implies that 
y = 4/3(C — 2?) (=-VC <2 <vVC), C > x* >0, so C must be 
positive. 


where C’ is a positive but otherwise arbitrary constant. 
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(b) The initial condition is y(0) = 3, so we substitute x = 0 and y = 3 into 


the general solution above. This gives 3 = aC, so C' = 27, and the 


required particular solution is 
You verified in Exercise 1.2(d) 


y=4/ 3(27 —x2) (-3/3<a<3V3). that this function is a solution 


of the differential equation. 


*Exercise 3.3 


A mass m(t) of a uranium isotope, which is present in an object at time ft, 
declines over time due to radioactive decay. Its behaviour is modelled by 
the differential equation 


dm ; . 

—-=-)\m (m>0), This model can be applied to 

dt other radioactive substances 
where the decay constant X is a positive constant characteristic of the ura- by selecting the appropriate 
nium isotope. value of the parameter X. 


(a) Find the general solution of this differential equation. 


(b) Find the particular solution for which the initial amount of uranium 
present (at time t = 0) is mo. 


The condition m > 0 in Exercise 3.3 arose from the modelling context. This 
condition enabled us to find the general solution without needing to worry 
about dividing by zero at Step (a) of the separation of variables method 
(and hence without needing to restrict the image set further). Suppose we 
were to forget the modelling context — that is, suppose we were to remove 
the restriction m > 0. How does this affect the solution process? And how 
do we cope with the case where m = 0? These questions are answered in the 
following example where, to emphasize the absence of the previous modelling 
context, the variables used are x and y. 


Example 3.3 


Find the general solution of the differential equation 


where X is a non-zero constant. 


Solution 

To apply the separation of variables method, we need to exclude the cases 
where y= 0. So, for y #0, on dividing through by y, integrating with 
respect to x, and using the rule for integration by substitution on the left- 
hand side, we obtain 


[iw = [or di. (3.8) 


Integrating, we obtain You saw in Unit 1 that 


1 
In|y| =—Ax + B, [punt WA. 
where B is an arbitrary constant. Taking exponentials gives 


ly| = e Att B 


or, removing the modulus sign, 
Le AttB = 


+eFe** = Ce”, 


y= 


where C' = +e? is a non-zero but otherwise arbitrary constant. 
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This is not quite the general solution, as we have to consider what happens 
when y = 0. Now, looking at the above solution, it is natural to ask what 
happens when C'=0. This gives the zero function, y = 0 for all x, and 
inspection of the differential equation shows that this is a particular solution. 
So we now have the general solution 


y= Ce. 


where C is an arbitrary constant. (Positive C corresponds to y > 0, negative 
C to y < 0, and C = 0 to the particular solution y= 0.) 


The above example illustrates that: 


e the separation of variables method requires that h(y) #4 0 and gives a 
family of solutions containing an arbitrary constant; 


e the case when h(y) = 0 is exceptional and can give extra solutions that 
may or may not have the same form as the family of general solutions. 


The following exercises provide you with some practice at applying the sepa- 
ration of variables method and at completing the general solution for values 
of y such that h(y) = 0. 


Exercise 3.4 


Find the general solution of each of the following differential equations. 


dy  y-1 dy 2y 
ee = (x > 0) ) ee 


*Exercise 3.5 
Solve the initial-value problem 


dv 
eutu 


— = 0) =0. 
oF , v(0) 


End-of-section Exercises 


Exercise 3.6 


Find the general solution of each of the following differential equations, 
where a is a non-zero constant. 


(a) H= > (ua) 
(b) M=-_+* _ @ 40, 41/2) 


dx x(1—az) 
Exercise 3.7 
Find the general solution of each of the following differential equations. 
(a) w=au = (b) ¢=14+2? 
Exercise 3.8 
(a) Solve the initial-value problem 
dP P 
Be ot Se P(0) = Py (where Pi 
a RP (1 7) ; (0) » (where Po > 0), 
where k and M are positive constants. 


(Hint: For the integral involving P, the solution for Exercise 3.6(b) 
should be of use.) 


(b) Describe what happens to the solution P(t) as t becomes large. 
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The differential equation here 
is the logistic equation 
(Equation (2.2)) which, as 
was pointed out earlier, may 
be used as a model for the 
size P(t) of a population at 
time t. The direction field of 
this equation in a specific case 
was examined in Exercise 2.1 


(page 74). 


Section 4 Solving linear differential equations 


4 Solving linear differential equations 


This section presents one final method of analytic solution for first-order 
differential equations. The details of this integrating factor method appear 
in Subsection 4.2. It applies only to a particular form of equation known 
as a linear differential equation. The definition and some properties of this 
type of equation are introduced in Subsection 4.1. 


4.1 Linear differential equations 


This subsection introduces the concept of linearity as applied to differen- 
tial equations. Here the concept is introduced in the context of first-order 
differential equations, but you should be aware that the idea generalizes to 


higher-order differential equations and is important from a theoretical point Linear second-order 
of view. differential equations are 
considered in Unit 3. 


Definitions 


(a) A first-order differential equation for y = y(x) is linear if it can be 
expressed in the form 


dy 


— + 9(x)y = h(x), (4.1) This differential equation can 
dar be written in the general form 
where g(x) and h(x) are given functions. dy faa) 
yen: ZY 
(b) A linear first-order differential equation is said to be homoge- da . 
F A that we have been using by 
neous if h(x) =0 for all z, and inhomogeneous or non- puitbine 
homogeneous otherwise. fae gain hia), 


For example, the differential equation 


dy 


2 3 
S-=$£y=2 
dx # 
is linear, with g(x) = —x? and h(x) = x, whereas the equation 
dy 2 
= 7 
dx ¢ 


is not, due to the presence of the non-linear term y?. 


*Exercise 4.1 


Decide whether or not each of the following first-order differential equations 


is linear. 
dy BS dy _ ‘ dz _ 1/2 
(a) ae {=z (b) qe sine (c) (Ta 3z 


d 
(2) g+yP=t (oe) ety ay (f) (14a) + bey = 32” 


An important theorem guarantees that an initial-value problem based on a 
linear first-order differential equation has a unique solution. 
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Theorem 4.1 


If the functions g(x) and A(x) are continuous throughout an interval 
(a,b) and xo belongs to this interval, then the initial-value problem 


—_ +9(z)y=h(x), y(xo) = yo, 


has a unique solution throughout the interval. 


This is a very powerful result, since it means that once you have found a 
solution in a particular interval, that solution will be the only one. 


There is a particularly useful technique for solving linear differential equa- 
tions, to which we turn next. 


4.2 The integrating factor method 


As you have seen, the method of separation of variables relies upon an 
application of the rule for integration by substitution, which is equivalent to 
the Composite Rule (or Chain Rule) for derivatives. It is natural to enquire 
whether there might similarly be a method for solving first-order differential 
equations that derives from the rule for integration by parts or, equivalently, 
from the Product Rule for derivatives. There is indeed such a method, and 
it is the subject of this subsection. 


To introduce the topic, consider the differential equation 
d 
(1+ 2?) 7 + 2ry = 327. (4.2) 
x 


Note first that 2a (the coefficient of y) is the derivative of 1 + a? (the co- 
efficient of dy/dx). It follows from the Product Rule that 


d d 
= (1+ 2?)y) = (2 +0?) + 2ay. 


The right-hand side of this equation is the same as the left-hand side of 
Equation (4.2), so we can rewrite the latter as 


< ((1+27)y) = 322. (4.3) 


Now the left-hand side here is just the derivative of (1+ 2x)y, so we can 
apply direct integration to Equation (4.3) to obtain 


(t+2%)y= f stde= 2° +¢, 


where C is an arbitrary constant. Division by 1 + 2? then gives the general 
solution of Equation (4.2) explicitly, as 

r+C 

1+22- 
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This includes the possibility 
that either a = —oo or b= ow, 
so the interval might be all of 
the real line. 


As you saw in Exercise 4.1(f), 
this differential equation is 
linear; but it is not soluble by 
direct integration or by 
separation of variables. 


Section 4 Solving linear differential equations 


This solution was arrived at by noting that the left-hand side of Equa- 
tion (4.2) is of the form 


gu ae 
4 4.4 
P ae dx” ie 
where p = 14+ 2, and that this form can be re-expressed, using the Product 
Rule, as 
d 
ag (PY): 


Linear differential equations need not come in this convenient form. For 
example, the left-hand side of the equation 


2 
f+ (5 7 y-SS (4.5) 
dx 


is not of the form (4.4). However, Equation (4.2) can be obtained from 
Equation (4.5) on multiplying through by p= 1+ 2”. For this reason, 
p=1+2? may be called an integrating factor for Equation (4.5): it is 
the factor by which Equation (4.5) needs to be multiplied in order that the 
resulting differential equation has a left-hand side of the form (4.4), enabling 
direct integration to be performed. 


This leaves the question of how such an integrating factor can be found, 
starting from Equation (4.5). The answer comes from writing down the two 
properties that such a function p = p(x) must satisfy, as follows. 


e Multiplying Equation (4.5) by p gives, on the left-hand side, 


dy 22 
P ae P 1+ 2? - 


e The left-hand side must be of the form 


p—+—y. (4.4) 


Comparison of these two expressions shows that p must itself be a particular 
solution of the differential equation 


dp 2x 


This is a homogeneous linear first-order differential equation, and we can 
solve it by separation of variables. Indeed, following Procedure 3.2, the 
equation becomes (for p 4 0) 


Fae ies 
ie? 


Performing the left-hand integral gives 


2x 
el i ie 


|p| = exp (/ ae i) (4.7) 


SO 
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Now, performing the integral on the right, 

= exp(In(|1 + 2?|) + A) 

= exp(A)|1 + 2?| 

= D(1+2”"), 

where D (= exp(A)) is a positive but otherwise arbitrary constant. Hence 
p=+D(1+2°), 

which, by redefining D, can be written as 
p= D(1+2”), 


where D is now a non-zero but otherwise arbitrary constant. 


|p 


Thus an integrating factor for Equation (4.5) is p(x) = D(1 +27). Multi- 
plying through the equation by this factor yields 


d 
D(i1i+ 2) + 2Day = 3Dz’, 


and now you can see that (since D 4 0) the arbitrary constant D can be 
chosen without affecting the applicability of the form (4.4). Therefore we 
choose the integrating factor to have the simplest possible form — in this 
case we obtain p(x) = 1+ 2”. 


As you have seen, this leads to the solution of Equation (4.5) by direct inte- 
gration, and the formula for this integrating factor is given by Equation (4.7) 


as 
2x 


This approach generalizes to any linear first-order differential equation, pro- 
vided that the integrals involved can be evaluated. For an equation written 
in the form 

dy 

— = h(x), 4.1 

o + o(x)y = A(x) (4.1) 


the function g(x) takes the place of 22/(1 +27) in Equation (4.5). To find 
an integrating factor p = p(«) for Equation (4.1), the argument proceeds as 
above, with 22/(1+ x) replaced by g(x) at each step. This leads to the 
generalized form of Equation (4.8), namely 


phe ( 7 as ir) (4.9) 


which defines the integrating factor for Equation (4.1). 


When Equation (4.1) is multiplied through by the integrating factor, the 
resulting differential equation is 


ro) SE + plaala)y = pla)h(a), (4.10) 


the left-hand side of which, by the definition of p, is of the form (4.4); so 
Equation (4.10) can be re-expressed, using the Product Rule, as 


—(p(«)y) = p(a) h(a). (4.11) 


Direct integration can then be used on Equation (4.11) to try to find the 
general solution. 


This integrating factor method is summarized below. 
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Note that 1+ x? > 0, so 
jl+a?7|=1+4+ 27. 


The case D = 0 corresponds 
to the solution p = 0 of 
Equation (4.6), but this 
solution is not of interest. 


Remember that calculation of 
the integrating factor does 
not require the inclusion of a 
constant of integration. 


The definition of p ensures 
that the left-hand side of 
Equation (4.10) is of the 
form (4.4) since 


Section 4 Solving linear differential equations 


Procedure 4.1 Integrating factor method 


This method applies to differential equations of the form 


d 
a + 9la)y = (2). (4.1) 
ie 
(a) Determine the integrating factor 
p = exp (fate) ar) ‘ (4.9) 


(b) Multiply Equation (4.1) by p(a) to recast the differential equation 
as 


r(x) + pla)ale)y = pla)h(a). 


(c) Rewrite the differential equation as 


 (p(2)y) = vle)h(z). 


(d) Integrate this last equation, to obtain 


p(x)y = [oon dx. 


(e) Divide through by p(a), to obtain the general solution in explicit 
form. 


As with the separation of variables method, it may not be possible to perform 
the necessary final integration. However, in the remainder of this subsection 
we give examples and exercises for which this method can be used. 


Example 4.1 


Use the integrating factor method to find the general solution of each of the 
following differential equations. 


dy Qxry dy y-1 dy 2y 
(a) de" 241 ) dx e eed (c) dx 1+2? 
Solution 
(a) On rearranging the differential equation as 
dy 2xry 
ae =f, 
dx x?+1 
we see that it is in the form of Equation (4.1) with 
2 
g(x) = Zul and A(z) =a. 


The integrating factor (from Equation (4.9)) is therefore 


— Jeune 
p= exp gl 


= exp(In |x? + 1]) 
= exp(In(2? + 1)) 
=a? +1. 


(since 1+ 2? > 0) 


The constant of integration is 
not needed here. 


You can, if you wish, check 
that you have found p 
correctly by checking that 


r(x) SE + plogla)y 


= = way), 


i.e. by checking that 
dp/dx = p(x)g(2). 


It is a good idea to check, by 
substitution into the original 
equation, that the function 

obtained is indeed a solution. 


The first example cannot be 
solved by separation of 
variables. The latter two can, 
as you saw in Exercise 3.4. 
You can compare these 
answers with those obtained 
earlier. 


Checking, we see that 


oP = oe = g(2)p(2). 
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Multiplying both sides of the differential equation by this factor yields 
d 
(2? + j= + 2Qey = a(x? + 1), 
dx 
and the differential equation thus becomes 


< ((a7 + 1)y) =2(x? + 1). 


Integrating both sides gives 
(a? + ly = [eo +1) dx 


= / (a? + x) dx 
= qu" i 5a" +, 
where C is an arbitrary constant. Finally, to obtain an explicit solution 
we divide by x? + 1 to obtain 
gt + 227+ 4C 
A(x? + 1) 


y= 


(b) On rearranging the differential equation as 
dy 1 1 


ie ge 
we see that it is in the form of Equation (4.1) with g(x) = h(x) = —-1/za. 
The integrating factor (from Equation (4.9)) is therefore 


raon(f(2) 4) 


=exp(—Inz) (since z > 0) Recall that alnx = In(x*) 
1 and hence, in particular, 
= exp (1m (=)) —Ing = In(x~*) = In(1/z). 
1 
=-_, Checking, we see that 
x 
dp 1 
Multiplying through the equation by p(x) = 1/x gives as g(x)p(2). 
ldy 1 1 
edz a’ ge’ 


and the differential equation becomes 


df1\ 1 
7G is ae 


Integration then gives 


eof Ca) 


=—4C, 
x 


where C is an arbitrary constant. The general solution is therefore 
y=14+Crz, 


where C is an arbitrary constant. 
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(c) In order to put the given differential equation into the form (4.1), we 
need to bring the term in y to the left-hand side to obtain 


ay = 0. (4.12) 


Hence, in this case, we have g(x) = —2/(1 +7) and h(x) =0. The 
integrating factor is 


2 A 
——— (/ (-) ir) = exp(—2arctan.x) = eW2 eter, 


Multiplying through by the integrating factor gives 


e2 arctan x dy _ 2y 
1+ 2? 


—2arctanz __ 0 


da 
Thus the differential equation can be rewritten as 


eo? ony) =). 


7 
It follows, on integrating, that 


—2arctanz,, _ = 2arctan x 
Yy —— Cs ¥y al Ce ’ 


where C is an arbitrary constant. This is the general solution. lH 


e€ or, equivalently, 


*Exercise 4.2 


Find the general solution of each of the following differential equations. 


(a) @ 


dy _ 
dn (bh) —-=9+e 


—y=e" sing 
y dx 
Exercise 4.3 


Use the integrating factor method to solve each of the following initial-value 
problems. 


a(0) = 2. 
y(1) = 1. 


(a) a =e, 
(b) ty + 2y =, 


End-of-section Exercises 


Exercise 4.4 
Which method would you use to try to solve each of the following linear 
first-order differential equations? 


dy 30.8 dy _ : 
(a) ap v==z (b) qe sine 


XL 


du 


dy 
—+5v= 2) 27 2 
(c) 5u = 0 (d) (1+ a*)—+2ry=1l+a 


Exercise 4.5 — 
Solve each of the following initial-value problems. 
(a) gyty=t+1, y(1)=0. 


by egetaety, y0)=3, 


The equation is homogeneous. 


Checking, we see that 


dp 7 —Qe~Zarctan x 
dx 1+? 
2 
— p,-2arctanz [ _ 
a ( ay 
= p(x)g(a). 


For part (b), see 

Examples 1.1 and 2.1. 

A direction field diagram is 
shown in Figure 2.2. 


You saw the differential 
equation in part (a) in 
Exercise 3.7(a), where you 
solved it using separation of 
variables. 


The differential equation in 
part (a) is equivalent to that 
considered in parts (e) and (f) 
of Exercise 1.2. 
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Exercise 4.6 


Find the general solution of each of the following differential equations. 


d 
(a) oe _3y=2 a> 0) 
dx 
dv 
—+4y= 2t 
(b) ee u= 308 


(Hint: If a and 6 are non-zero constants, then 


at 


at _ € : 
Je cos bt dt = Gay pe (cos bt + bsin bi) +C, 


where C is an arbitrary constant.) 


5 Finding analytic solutions on the 
computer 


In this section you will see how direct integration, the method of separation 
of variables and the integrating factor method can be used on the computer 
to solve first-order differential equations. 


Use your computer to complete the following activities. 


*Activity 5.1 

Use direct integration to solve the initial-value problem 
dy —3 5 
dz AE, a y(0) 7 3° 


Compare your solution with that obtained in Example 3.1. 


*Activity 5.2 


Use separation of variables to find the general solution of each of the follow- 
ing differential equations. 


dy dy 2y dy 2 
Oe a =] 
@) a, YO) a CO 

Compare your solutions with those obtained in Example 3.3 and Exer- 
cises 3.4(b) and 3.7(b), respectively. 


*Activity 5.3 


Use the integrating factor method to solve the following initial-value prob- 
lems. 


dy _ _ 
(a) qe ty y(0) =0. 


d 
(b) co +2y=a%, y(=L 


Compare your solutions with those obtained in Exercises 4.2(b) and 4.3(b), 
respectively. 
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*Activity 5.4 


Use the integrating factor method to find the general solution of each of the 
following differential equations. 


(a) 


ot —ay=s (b) 4 dy = 8008 2x 


Compare your solutions with those obtained in Exercises 4.6(a) and 4.6(b), 
respectively. 


Outcomes 


After studying this unit you should be able to: 


understand and use the basic terminology relating to differential equa- 
tions and their solutions; 

check by substitution whether a given function is a solution of a given 
first-order differential equation or initial-value problem; 

find from the general solution of a first-order differential equation the 
particular solution that satisfies a given initial condition; 

appreciate the difficulties with domains and image sets for the solution 
of some differential equations; 

deduce the qualitative behaviour of solutions from consideration of a 
first-order differential equation itself, as visualized from its direction 
field; 

set up the formulae required by Euler’s method for solving an initial- 
value problem, carry out a few steps of the method by hand, and use 
the computer to deal with large numbers of steps; 

recognize when a first-order differential equation is soluble by direct 
integration, and carry out that integration when appropriate, by hand 
in simple cases and otherwise on the computer; 

recognize when a first-order differential equation is separable, and ap- 
ply the method of separation of variables by hand in simple cases and 
otherwise on the computer; 

recognize when a first-order differential equation is linear, and solve such 
an equation by the integrating factor method, by hand in simple cases 
and otherwise on the computer. 


Outcomes 
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Solutions to the exercises 


Section 1 


P 
1.1 We have r(P) =k{1—- if 


to solve the following pair of simultaneous equations: 


e(1— 10000 = 
M 


From the second equation, since k > 0, we see immedi- 
ately that M = 10000. Substituting in the first equa- 


tion leads to 


fo =1, so k= cy 
1000 999 


), so we simply need 


1.2 In each case, differences in notation notwithstand- 


ing, the differential equation has the form 
dy 


dz — f(x,y), 


and we need to show that the given function y = y(x) 
satisfies this equation, i.e. gives the same expression for 
either side of the equation. 
(a) If y = 2e” — (x? + 22+ 2), then differentiating y 
gives 

dy 


de 72° 7 At 2; 


and substituting the expression for y into the expression 
for f gives 


f(x,y) =y+2? = 2c" — (2? +2r4+2)4+2° 
= 2e* — 2x4 —2, 

as required. 
(b) If y= $a? + 3, then 

d 
7 =a and f(x,y) =7. 
(c) If u=2e*’/?, then 

/ du x? /2 


u er ee 


and 
f(a,u) = 2u= Qe” /?, 
(d) If y= /Q7—2?)/3 (-3V3 < x < 33), then 
dy (27-a?\? 
dx 8 3 
and 
x x (27-22\ 1? 
(e) Ify=t+e‘, then 


fy) =—ytt+1=—-(t+e*)+t4+1=1-e7%. 
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(f) Ify=t+Ce~‘, then 


_ dy -t 
= Ce 
and 
fy) =—-y+t4+1=-(+Ce) +441 


=1-Ce™. 


1.3 In each case, differences in notation notwithstand- 
ing, the differential equation has the form 


d 
ie = fle), 


and we need to show that the given function y = y(x) 
satisfies this equation, i.e. gives the same expression for 
either side of the equation. 
(a) fy=C— je~**, then 

d 

°F _ e-8® and fixyjy=e"**. 

da 
(b) If u=Ce' —t—1, then 

du 

4 = — =Ce'-1 

t= e€ 
and 

f(t,u) =t+u=Ce -1. 
(c) If 


then, using the Quotient Rule for differentiation, 
dP (1+ Ce*) (CMke**) — (CMe**) (Cke**) 
dt (1+ Ce**t)2 


_ CMe*t 1+ Ce* — Cet 
7 1+ Cekt 1+ Cekt 


= CMekt i Cekt 
7 1+ Cekt 14+ Cekt 


1.4 (a) From Exercise 1.3(c) we know that 
_ CMe Ace 
~ 14 Cek% 14 Ceo-15t 
is a solution of the differential equation. The initial 
condition P(0) = 1 then implies (since e° = 1) 
10C 


P(t) 


1 = —— C=}1, 
ico 2 
A particular solution is therefore 
ae 10 60.15¢ _ 10¢-15¢ | 
t+ ger lt Q + ¢0.15t 
(b) Dividing top and bottom by e°!°*, we see that 
10 
—— 
Qe—0.15t +1 


For large values of t, the exponential term on the bot- 
tom will be very small. The result is that P will ap- 
proach the value 10 in the long term. 


1.5 Calculator results are given to eight significant fig- 
ures. (Different calculators may give slightly different 
results. ) 


(a) f(3.142) = 31.018 339, 
f (8.141 592 6) = 31.006 275. 
So we have 
error = f(3.142) — f(z) 
~ f (3.142) — f(3.1415926) 
~ 0.012. 


(b) f(3.142) = 4,421 1982 « 10", 
f (3.141 5926) = 4.403 1482 x 107°. 
So we have 
error = f(3.142) — f(z) 
~ f (3.142) — f(3.141 5926) 
#1810", 


1.6 (a) If y=arcsinz +C (—1 < 2x < 1), then differ- 
entiating gives 
dy 1 


dx V1 — 2?’ 
so y satisfies the given differential equation. 
(b) The initial condition is y() = 3, 
L= 5. On substituting these values into the solution 
from part (a), we have 

Z =arcsing +C = ed, 
This gives C = 3, 
problem is 


i.e. y= 5 when 


so the solution of the initial-value 
y=arcsing+ 3 (-l<2<\1). 


1.7 (a) Ife =tan(¢+ C) (-% <t+C < $), then dif 
ferentiating gives 


. a 
t= ai = sec?(t+ C) =1+ tan?(¢+C) =1+2’, 
so x satisfies the given differential equation. 


(b) The initial condition is 2(4) = 1, ie. e =1 when 
t= 4. On substituting these values into the solution 
from part (a), we have 


ic tan(4 + C) F 
This gives C = arctan 1 — 7 = 0, so the solution of the 
initial-value problem is 


x= tant (-% <t<§). 


Section 2 


2.1 (a) The slope is shown to be zero at all points on 
the horizontal lines P = 0 and P = 1000, so these corre- 
spond to constant solutions of the differential equation. 
(As pointed out earlier in the text, these two solutions 
can also be spotted directly from the form of the differ- 
ential equation.) 


Solutions to the exercises 


The graphs of solutions through a starting point above 
the line P = 1000 appear to decrease, but at a slower 
and slower rate, tending from above towards the limit 
P = 1000 as ¢ increases. 


The graphs of solutions through starting points in the 
region 0 < P < 1000 are increasing, with slope grow- 
ing before the level P = 500 is reached and declining 
thereafter. For large values of t, these graphs tend from 
below towards the limit P = 1000. 


For a starting point in the region P < 0, the graphs de- 
crease without limit and with steeper and steeper slope. 
These various cases are illustrated by typical graphs in 
the figure below. 


Pa 
1500 \ : 4 iY 4 4 
\ \ \ \ \ \ \ 
\ \ \ \ \ \ \ 
\ \ \ \ \ \ 
1000 ——* NN NN NN 
ia - - - = - - - - - 
~ a 7 7 a a 7 7 Z 7 
4 4 7 4 7 7 e Aa 7 ee 
500% 4% 74 4 4 4 4 4a 
v 7 7 4 7 7 7 7 7 7 
ia 7 a a a a a a a7 
— - - - - - - - - - 
0 rT - r 7 * 
N N N 4s \N 6s N gs \ * 
\ \ \ \ \ \ \ \ \ 
\ \ \ \ \ \ \ \ \ 
500 A \ \ \ \ \ \ \ \ \ 


(b) If the differential equation is considered as a model 
of population behaviour, then the region P < 0 must be 
excluded. The analysis above leads to the following pre- 
dictions for the population. 


e Ifthe population is zero at the start, then it remains 
Zero. 


e Ifthe population size starts at 1000, then it remains 
fixed at this level. 


e Ifthe population starts at a level higher than 1000, 
then it declines (more and more gradually) towards 
1000. 


e If the population starts at a level below 1000 (but 
above 0), then it increases and eventually tends 
gradually towards 1000. 


2.2 For the initial-value problem 

Yay, y(0)=1, 
we have 2 = 0, Yo = yo = 1 and f(a;,Y;) = Y;. The 
step size is given as h = 0.2. Equation (2.9) with 7 = 0 
gives 

a, =a th=04+0.2=02, 
and Equation (2.10) with 7 = 0 gives 

Y= + hfe H) = 1202 kia 


101 


Unit 2 First-order differential equations 


Applying Equations (2.9) and (2.10) in turn for 
i = 1,2,3,4, we obtain the following table. 


a 24 Y; flenYJ=% Yau =%+hf(en%) 
00 1 1 1.2 

1 0.2 1.2 1.2 1.44 

2 0.4 1.44 1.44 1.728 

3 0.6 1.728 1.728 2.0736 

4 0.8 2.0736 2.0736 2.488 32 

5 1.0 2.488 32 


The approximation to y(1) is 2.488 32. 


2.3 Since we are told that, for sufficiently small h, the 
absolute error is proportional to the step size h, we can 
deduce from the last row of Table 2.2 that there exists 
a constant k such that 


0.000 136 = 0.0001k, 


so k = 1.36. In order to determine y(1) correct to five 
decimal places, h must be such that 

1.36h <5 x 10° 
or 

5 x 10-6 
h< ——_— ~37x 10°. 
< 736 ‘ 

So a suitable choice of h would be 10~® = 0.000 001. 


(In fact, using this value of h gives an approximation to 
y(1) of 2.718 280, which is correct to 5 decimal places.) 


2.4 (a) The slope defined by the direction field 
f(a,y) = y+? is zero when y = —x?, which is a 
parabola in the lower half-plane with vertex at the 
origin. Below this parabola we have y < —x? and 
f(x,y) <0, while above the parabola we have y > —2? 
and f(x,y) > 0. Thus all slopes for points of the plane 
below the parabola y = —2? are negative, and all slopes 
for points above it are positive. 

Also, if x is fixed, then f(x,y) = y+ 2? is an increasing 
function as y increases. If instead y is fixed, then for 
x > 0, f(x,y) increases as x increases, and for x < 0, 
f(x,y) increases as « becomes more negative. These 
observations indicate that the slope given by the di- 
rection field increases as we move from bottom to top 
along any vertical line, whereas on moving along any 
horizontal line, the slope increases with distance from 
the y-axis. 


(b) The features described in the solution to part (a) 
are all apparent on the direction field diagram. This 
direction field diagram is repeated below, with the 
parabola y = —x? superimposed upon it. (Note that 
this parabola does not represent a solution of the dif- 
ferential equation.) 
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(c) It appears from the direction field that there are 
several types of solution. Any solution whose graph 
cuts the y-axis above the origin has positive slope at all 
points. The solution graph that passes through the ori- 
gin has zero slope there, but positive slope everywhere 
else. Any solution graph that cuts the y-axis below 
the origin has a maximum (where it meets y = —a? for 
x <0). Some of these graphs also have a minimum 
(where they meet y = —x? for z > 0). Others have no 
minimum (though this is not clear from the diagram 
given). A solution graph of each type is sketched be- 
low. 
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(d) The initial-value problem is 


dy 2 
de 
From Equations (2.9) and (2.10), the necessary formu- 


lae are 


y(—1) = —0.2. 


C1 = ath, 


Yi4d = x ar hf (a;, Yj). 


For the current problem, 29 = —1, Yo = yo = —0.2, 
f (xi, Yi) = Yi + 2? and h =0.1. The particular formu- 
lae needed here are therefore: 
t41 = 2;,+0.1, where rp = —1; 
Yin =¥i+0.1(¥; +27), where Yo = —0.2. 
The second of these formulae can also be written as 
Yin1 = 1.1¥;+0.12?, where Yp = —0.2. 


Section 3 


3.1 We apply direct integration to find the general so- 
lution. In each case, C is an arbitrary constant. 


(a) The differential equation dy/dx = 6x has general 
solution 


y= [ 6rde = 30 +¢. 


From the initial condition y(1) = 5, we have 5=3+C, 
so C =2. The solution of the initial-value problem is 
therefore 

y = 327 + 2. 


(b) The differential equation du/du = e*™ has general 
solution 


v= fe du= se +C. 


From the initial condition v(0) = 2, we have 2= ++C, 
soC= t The solution of the initial-value problem is 
therefore 


v= ze™ +7, 
(c) The differential equation y = 5sin2t has general 
solution 


y= [ssinzeat = —3 cos 2t+C. 


From the initial condition y(0) = 0, we have 
0= -3 +C,soC= 3. The solution of the initial-value 
problem is therefore 


y = 3(1 —cos2t). 


3.2 (a) The differential equation dy/dxz = re~?” has 
general solution 


y= | vo dx. 


The integral may be found using integration by parts. 
Taking f(x) = x and g'(xz) = e~?*, and using the for- 
mula 


/ F(a)g' (a) de = f(e)g(a) - / f'(a)9(a) de, 


poor dx = —tae** + , te 7 dx 


1,.,—2 1,-2 
‘7 Ze a t C,; 


where C is an arbitrary constant. The general solution 
of the differential equation is therefore 


y= —4(2e + 1)e-** +C. 


Solutions to the exercises 


(b) The differential equation p = t/(1 +t?) has general 
solution 


t 
= | —~ dt. 
’ lm 


Using the hint provided, we make the substitution 
u=1+#, for which du/dt = 2t. This gives 


t 1 
t= 2t) dt 
lbest ae Me 


1 
_l 
=iInu+C (sinceu=1+t? >0) 
= ;ln(1+t)+C, 
where C is an arbitrary constant. The general solution 
of the differential equation is therefore 


p=$sln(l+?t?)+C. 


3.3 (a) The differential equation is dm/dt = —\m, 
where m > 0. Following Procedure 3.2, we obtain 


[gama [ayae 


and, since m > 0, integration produces 

Inm = —At+ B, 
where B is an arbitrary constant. On solving this equa- 
tion for m, by taking the exponential of both sides, we 
obtain 

m = eOt+B — eBe-™ — Ce-™, 
where C = e? is a positive (since e? > 0 for all B), but 
otherwise arbitrary, constant. The general solution is 
therefore 


m=Ce™, 


where C is a positive but otherwise arbitrary constant. 


(b) The initial condition is m(0) = mo, from which we 
have mo = Ce®, so C = mo. The required particular 
solution is therefore 


m= moe ™. 


3.4 (a) The differential equation is 
dy y-1l 
dx 

In order to apply the separation of variables method, we 

need to exclude the cases where y= 1. So, for y ¥ 1, 

on applying Procedure 3.2 we have 


1 
foo- [pe 
y-—1 x 


Since x > 0, for y £1 (so that y— 1 4 0), integration 
produces 

Injy —1|=Ina+B, 
where B is an arbitrary constant. On solving this equa- 
tion for y, by first taking the exponential of both sides, 
we obtain 

y=1lt elntt+B 74 eBelme 14 Ca, 
where C = +e? is a non-zero but otherwise arbitrary 
constant. 


, where x > 0. 
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Examination of the differential equation shows that 
C=0 also gives a solution (the constant function 
y = 1). The general solution is therefore 

y=1+Cz, 
where C is an arbitrary constant. 
(If you cannot convince yourself that this is the general 
solution for all y, including y = 1, then you will see it 
proved in Example 4.1.) 
(b) The differential equation is dy/dx = 2y/(a? + 1). 
In order to apply the separation of variables method, 


we need to exclude the cases where y = 0. So, for y 4 0, 
on applying Procedure 3.2 we have 


1 2 


Since y £ 0, integration produces 
In |y| = 2(arctan a + B), 


where B is an arbitrary constant. On solving this equa- 
tion for y, we obtain 


2arctan 2+2B = L928 p2 arctan © = Ce? arctan o 


= re 


where C = +e?” is a non-zero but otherwise arbi- 
trary constant. Examination of the differential equa- 
tion shows that C' = 0 also gives a solution (the zero 
function y = 0). The general solution is therefore 

y= Ce? Se 


where C is an arbitrary constant. 


(If you cannot convince yourself that this is the general 
solution for all y, including y = 0, then you will see it 
proved in Example 4.1.) 


3.5 The differential equation is du/du = e“*” = e“e”. 
Dividing through by e” and integrating with respect 
to u, we obtain 


ie dv = fe du. 


Integration produces 
-e "=e"+B, 
where B is an arbitrary constant. On solving this equa- 
tion for v, we obtain 
v = —In(-e* — B) = —In(C - e”), 
where C = —B. We need C' > 0 and u < InC in order 


for the argument of In here to be positive. Hence the 
general solution is 


v=-In(C-—e") (u<lnQC), 
where C is a positive but otherwise arbitrary constant. 
The initial condition v(0) = 0 gives 0 = —In(C — e°), 
so C—e® = 1 and hence C = 2. The solution of the 
initial-value problem is therefore 


v=—In(2-—e") (u<In2). 


3.6 Each of the differential equations is soluble by di- 
rect integration. 


(a) The general solution of dy/du = 1/(u— a), where 
u # a, is given by 


1 
v= | du. 
u-a 
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Integration produces the general solution 
y=Inju—al+C, 
where C is an arbitrary constant. 


(b) The general solution of dy/dx = 1/(a#(1 — az)), 
where « 4 0, x £ 1/a, is given by 


= flaw 


This can be solved by using the substitution u = 1/2. 
Alternatively, if the integral is rearranged as 


1 1 
= eae se on 


then we can use the table of standard integrals in the 
Handbook. Either method of integration gives the gen- 
eral solution 


y=C-ln 


—-al, 
ax 


where C is an arbitrary constant. 


3.7 Each of the differential equations can be solved by 
separation of variables. 


(a) The differential equation is u’ = du/dx = xu. For 
the cases where u # 0, we divide through by wu and in- 
tegrate with respect to x. This gives 


[gaua [ vac. 
u 


Integration produces 


In |u| = $2? + B, 
where B is an arbitrary constant. On solving this equa- 
tion for u, we obtain 


2 2 2 
u = te? /2+B = + eB et /? = Cer /?, 


where C = te? is a non-zero but otherwise arbitrary 


constant. However, the case C' = 0 can be added, since 
it can be seen by inspection of the differential equation 
that the zero function u = 0 is a solution. Hence the 
general solution is 


2 
u=Ce® /?, 


where C is an arbitrary constant. 


(You verified that wu = Qe%"/2 isa particular solution of 
this differential equation in Exercise 1.2(c).) 

(b) The differential equation is ¢ = dxr/dt = 1+ 2°. 
We divide through by 1+ «x? and integrate with respect 
to t. This gives 


1 
ae ee cr 
lame / 


Integration produces 

arctanz =t+C, 
where C is an arbitrary constant. On solving for x, we 
have 

x =tan(t+C). 
A suitable domain for the solution is —} <t+C < §, 
since the image set of arctan is the interval (-&, a) 


Thus the general solution is 
g=tan(t{+C) (-3<t+C<%), 


where C is an arbitrary constant. 


(You verified that this is a solution of the given differ- 
ential equation in Exercise 1.7.) 


3.8 (a) The given equation is dP/dt = kP(1— P/M). 
First, note that (as remarked upon in Section 2) the 
constant functions P = 0 and P = M are both solu- 
tions. Assuming that we are considering neither of these 
possibilities (we are certainly not interested in P = 0 
since we know that Pp > 0), we can use the separation 
of variables method to obtain 


of parce = fie 


The integral on the left-hand side is of the form evalu- 
ated in Exercise 3.6(b), with 1/M in place of a. Hence 
we have 
1 7 1 1 
k |P M 
where B is an arbitrary constant. On solving for P, we 
find first that 


=i Bb 


i 


FL ect) 
PM ~ 
= +e-kB e—ht 
=Ce™ 
where C = te~*# is a non-zero but otherwise arbitrary 


constant. However, note that C = 0 corresponds to the 
constant solution P = M already noted, so the restric- 
tion C # 0 may be dropped. Hence we obtain 


-1 
P= € + ce) (e** 4 MC), 
where C is an arbitrary constant. 
From the initial condition P(0) = Pp, we have 
n= (Gree) so G22. 
M Po M 
The solution of the initial-value problem is therefore 


r= (4+(h-d)en) 
M Pp M , 
which yields 
P= as : 
1+ (M/Po —1)e—*# 
Finally, we rewrite this in the more familiar form 
7 Me** 
~ ekt + (M/Py —1)' 


kt 


(b) Ast — 00 we have e~** — 0, and consequently the 


value of P(t) approaches M. 

(Note that this is true whether the starting value Po is 
greater than or less than M. This result is consistent 
with the specific direction field shown in Figure 2.4.) 


Solutions to the exercises 


Section 4 


4.1 (a) The equation dy/dz + x°y = 2° is linear, with 
glay=e° and A(z) =2". 

(b) The equation dy/dx = xsinz is linear, with 
g(x) = 0 (for all x) and h(x) = xsin a. 

(c) The equation dz/dt = —3z'/? is not linear (because 
of the z!/? term). 

(d) The equation 7+ y? =t is not linear (because of 
the y? term). 

(e) The equation x(dy/dz) + y = y? is not linear (be- 
cause of the y? term). 

(f) The equation (1 + 2?)(dy/dx) + 2xy = 32? is lin- 
ear, since we can divide through by 1+ 2? to obtain 
dy/dx + 2xy/(1 + x”) = 327/(1+ 27), which is of the 
defined form with g(x) = 2x/(1+ 27) and A(x) = 
3a? /(1 +27). 


4.2 (a) The given equation is dy/dx — y = e*sinz. 
Comparison with Equations (4.1) and (4.9) shows that 
the integrating factor is 


p = exp (fo a) = exp(—2) = e7*. 


Multiplying through by p(x) gives 
——e “y=sing. 
Thus the differential equation can be rewritten as 
d 
ae = sine. 
On integrating, we find the general solution 

e “y=—cosx+C, 
or, equivalently, 

y =e" (C —cosz), 
where C is an arbitrary constant. 
(b) The given equation, when rearranged into 
form (4.1), is dy/dz —y =. This has the same left- 
hand side as the differential equation in part (a), and 
hence the same integrating factor, p = e~*. Multiplying 
through by p(2) gives 


beet’ ald _ e” = re”. 
dx 2 
Thus the differential equation can be rewritten as 
d 
—(e *y) =a2e”. 


dx 
On integrating (by parts on the right-hand side), we 
find 


e "y= fae da 
=-—re 7+ / e "dx 


=-re”—-e "+C 
=C-(a#2+1)e’, 
where C is an arbitrary constant. After multiplying 
through by e”, the general solution in explicit form is 
y =Ce® —(x#+1). 
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4.3 (a) The given equation, when rearranged into 
form (4.1), is du/dx — xu =0. The integrating factor is 


me (ge) 


= exp(—2"/2) 
=e /2, 
Multiplying through by p(x) gives 


ena /2du — ge~* /2y, = 0, 


da 
Thus the differential equation can be rewritten as 
d 
—(e? =0. 
da 


On integrating, we find the general solution 

en? ty = C, or, equivalently, u= Cet /?, 
where C is an arbitrary constant. 
From the initial condition u(0) = 2, we have 2 = Ce®, so 
C = 2. Hence the solution of the initial-value problem 
is 


u = Qe? /?, 


(b) After division by ¢, the given equation can be writ- 
ten as dy/dt + (2/t)y =t. (To avoid division by zero, 
we take t > 0, say, which is consistent with the initial 
condition.) The integrating factor is 


cy 


= exp(2Int) 
= exp(In(??)) 
=. 
Multiplying through by p(t) gives 
ady 
dt 
Thus the differential equation can be rewritten as 


P— 4+ Qty = #3. 


d 142 3 
—(t =f. 
ae» 
On integrating, we find the general solution 

ty = st! +C, or, equivalently, y= $0? +Cr?, 
where C is an arbitrary constant. 
From the initial condition y(1) = 1, we have 1 = $+C, 
so C= 3, Hence the solution of the initial-value prob- 
lem is 


y= 4(? + 3t7’). 


4.4 (a) and (d) require the integrating factor method. 
(b) is best solved by direct integration. (c) can be 
solved by separation of variables or the integrating fac- 
tor method. 
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4.5 (a) The given equation is dy/dt + y = t+ 1. Com- 
parison with Equations (4.1) and (4.9) shows that the 
integrating factor is 


to 


= exp(t) 
=e 
Multiplying through by p(t) gives 
dy 
t 


—+ey=(t+ le’. 
ently (t+ lL)e 


Thus the differential equation can be rewritten as 


< (ety) = (t+ Det 


On integrating (by parts on the right-hand side), we 
find 


ey = Kc + 1)e' dt 


= (t+ 1)e* -fe dt 


=(t+lje’-e' +C 

= te’ +C, 
where C' is an arbitrary constant. After multiplying 
through by e~*, the general solution in explicit form is 

y=Ce* +t. 

From the initial condition y(1) = 0, we have 
0=Ce-!+1, so C =—e. Hence the solution of the 
initial-value problem is 


y=t—e . 
(b) After division by e** and rearrangement, the given 
equation becomes dy/dt + y = e~*". This has the same 
left-hand side as the differential equation in part (a), 
and hence the same integrating factor, p = e’. Multi- 
plying through by p(t) gives 


d 
oe + ely =e 7. 
Thus the differential equation can be rewritten as 
d 
Hey) =e", 


On integrating, we find the general solution 
e'y=—te *+C, 

or, equivalently, 
y= Ce*— he *, 

where C is an arbitrary constant. 

From the initial condition y(0) = 3, we have 


3 = Ce? — $e°, so C = §. Hence the solution of the 


initial-value problem is 


y = 3(7e*-e~*). 


4.6 (a) After division by x (where x > 0), the given 
equation becomes dy/dx — (3/x)y = 1. The integrating 
factor is 


—32*y=27%. 
dx e 
Thus the differential equation can be rewritten as 


d 
(0 8y) =a, 


On integrating, we find the general solution 
ge oy = -ta? +C, 

or, equivalently, 
y = Cx? — $2, 

where C is an arbitrary constant. 


(b) The given equation is du/dt + 4v = 3cos2t. The 
integrating factor is 


rown(fos 


= exp(4t) 
ae, 


Multiplying through by p(t) gives 
d 
- + 4e*ty = 3e* cos 2t. 
Thus the differential equation can be rewritten as 


d 
ae %) = 3e* cos 2t. 


On integrating (using the hint for the right-hand side, 
with a = 4 and b = 2), we find 
ety = Ze (4cos 2t + 2sin 2t) + C, 
where C' is an arbitrary constant. After multiplying 
through by e~*, the general solution in explicit form is 
v = #(4cos2t + 2sin 2t) +Ce™. 


Solutions to the exercises 
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UNIT 3 Second-order differential 
equations 


Study guide for Unit 3 


This unit extends the ideas of Unit 2 from first-order differential equations to 
a particular type of second-order differential equation. This type of second- 
order differential equation has a variety of applications, some of which are 
considered later in the course. 


This unit requires no previous knowledge beyond that required for Unit 2, 
apart from some familiarity with complex numbers. The relevant material 
on complex numbers was revised in Unit 1 of this course. 


Sections 1 and 2 contain the most important material. 


The recommended study pattern is to study one section per study session 
and to study the sections in the order in which they appear. However, 
you may find that Sections 1 and 2 take you rather longer than Sections 3 
and 4, and you may wish to spread your study of Sections 1 and 2 over three 
sessions. 


Section 4 uses the computer algebra package for the course and is designed to 
help you to understand the nature of the solutions obtained in Sections 1-3. 
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Introduction 


Unit 2 introduced you to differential equations, and in particular to first- 
order differential equations that can be written in the form 


Such an equation is said to be of first order because it involves only the first 
derivative dy/dx of the function y = y(z). 


This unit considers second-order differential equations, that is, differential 
equations that involve a second (but no higher) derivative. Examples of 
second-order differential equations are 

2 2 

_ _ 3 +2y=4e” and sot +y’ sing = 27. 

As in the case of first-order differential equations, second-order differential 
equations, and in particular the derivatives in such equations, can be written 
in a variety of notations. For example, the second derivative of a dependent 
variable y with respect to an independent variable t (representing time) may 
be written as d?y/dt?, jj, y”, and so on. Also, as in the case of first-order 
equations, the dependent variable and sometimes the independent variable 
can be considered as functions, and the same symbol is frequently used for 
both the variable and the corresponding function. 


One particularly simple example of a second-order differential equation, with 
dependent variable s and independent variable ¢, is 

ds 

5; = 4, 0.1 

dt? (0.1) 
where a is a given constant. This equation can be solved by applying direct 
integration twice. One application gives 

ds 

dt 


where C is an arbitrary constant. Integrating a second time gives 


= fad=at+e, 


s= [oc + C) dt = fat? +Ct+D, (0.2) 


where D is another arbitrary constant. Equation (0.2) is the general solution 
of the second-order differential equation (0.1). 


Now, in Unit 2 you saw that the general solution of a first-order differential 
equation usually involves just one arbitrary constant. But here, even for 
such a simple second-order differential equation, the general solution involves 
two arbitrary constants (namely C and D). It is a property of second-order 
differential equations that the general solution usually involves two arbitrary 
constants. 


The remainder of the unit proceeds as follows. Section 1 concentrates on 
homogeneous linear constant-coefficient second-order differential equations, 
leaving inhomogeneous equations to Section 2. Section 3 considers the types 
of condition needed to move from a general to a particular solution. Finally, 
Section 4 uses the computer to examine the nature of solutions. 
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The order of a differential 
equation was defined in 
Subsection 1.2 of Unit 2. 


A second-order differential 
equation may or may not 
include a first derivative. 


Of course, the dependent 
variable is not always y! 


See Subsection 3.1 of Unit 2. 


Recall, from Unit 2, that the 
general solution of a 
differential equation is the 
collection of all possible 
solutions of that equation. 


Section 1 Homogeneous differential equations 


1 Homogeneous differential equations 


After a short introduction in Subsection 1.1, Subsection 1.2 shows how ho- 
mogeneous second-order differential equations can be solved. Subsection 1.3 
explains why the solutions thus obtained are indeed the required general 
solutions. 


1.1 First thoughts 


You will recall that a particular solution of a first-order differential equation 
is obtained by applying a single condition (known as an initial condition) to 
the general solution in order to find a particular value of the single arbitrary 
constant. In the case of a second-order differential equation, a particular 
solution is obtained by applying two conditions to the general solution in 
order to find particular values of the two arbitrary constants. The following 
example illustrates this. 


Example 1.1 


Suppose that a car is travelling with constant acceleration a along a straight 
road. If, at time ¢t, its distance from a fixed point is s, then its velocity 
is given by ds/dt, its acceleration is given by d?s/dt?, and its motion is 
modelled by 
ds 
dt? 
If the car is initially stationary at position s = 0 and thereafter has a con- 


stant acceleration of 2ms~?, how long does it take for the car to attain a 
velocity of 30ms~!, and what distance has it travelled in that time? 


=a, (0.1) 


Solution 


You saw in the Introduction that integrating Equation (0.1) leads to 


d 
qaate and s = 5at?+Ct+D, 


where C and D are arbitrary constants. To find these constants (and hence 
answer the questions asked), we need to make use of the conditions given. 
These are that the car is initially stationary (i.e. ds/dt = 0 when t = 0) at 
position s = 0 (i.e. s = 0 when t = 0). The first of these conditions, together 
with the equation ds/dt = at + C, tells us that C= 0. With C = 0, the 
second equation becomes s = sat? + D, and this together with the second 
condition tells us that D = 0. 


Therefore, when a = 2, we have 


ds ds 9 

= =2, —=2t = te 

dt? > dt os 
So the velocity is ds/dt = 30 when 2¢ = 30, i.e. after 15 seconds, and in this 
time the car has travelled s = 15? = 225 metres. I 


The solution of second-order differential equations is rarely as easy as the 
solution of Equation (0.1) above. In fact, the approach of repeated direct 
integration works for only some equations of the form 

d?y 

—-9 = f(z). 


dx? 
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Most second-order differential equations cannot be solved by analytic meth- 
ods at all, and numerical methods have to be employed instead. However, 
there is one important class of second-order differential equations that can 
be solved by analytic means: this is the topic of this unit, and we introduce 
it next. 


Linear constant-coefficient differential equations 


This unit considers linear constant-coefficient second-order differential equa- 
tions. But what exactly do the terms ‘linear’ and ‘constant-coefficient’ mean 
in this context? 


The answer lies in the following definitions. 


Definitions 
(a) A second-order differential equation for y = y(«) is linear if it can 


be expressed in the form 


d? d 
a(x) 5 + W(x) = f(2), 


where a(x), b(x), c(x) and f(a) are given continuous functions. 


+ e(x)y = 


A linear second-order differential equation is constant-coefficient 
if the functions a(x), b(a) and c(z) are all constant, so that the 
equation is of the form 

d?y dy 
ag tbe + ey = fle x), 
where a 4 0. 


(c) A linear constant-coefficient second-order differential equation is 
said to be homogeneous if f(x) = 0 for all 2, and inhomoge- 
neous (or non-homogeneous) otherwise. 


(1.1) 


Linear constant-coefficient second-order differential equations can be written 
in other ways. For example, we can divide Equation (1.1) through by a to 
obtain an “ue of the form 

dy 


oe + J 4 ay = 6(2), 


and this more closely resembles the definition of linear first-order differential 


equations from Unit 2. 


*Exercise 1.1 


Consider the following second-order differential equations. 


, ay d’y dy aay ay dy 
. Py dy ; d?y dy 
2 
(iv) ay" +2°y=0 (vy) ft grea. ayes +4y = 32 
dt 
(vii) 2 77 + af +4t=sin@ (viii) ¢=—4t (ix) = —4er 


(a) Which of the equations are linear and constant-coefficient? 
(b) Which of the linear constant-coefficient equations are homogeneous? 


(c) For each equation, identify the dependent and independent variables. 
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Such numerical methods are 
discussed in Unit 26. 


You met the idea of a linear 
first-order differential 
equation in Unit 2. 


Compare the definitions for 
first-order equations in 
Subsection 4.1 of Unit 2. The 
important feature is the 
linear combination of y and 
its derivatives on the 
left-hand side. 


If a = 0, then the equation is 
first-order. 


Section 1 Homogeneous differential equations 


One of the main reasons for concentrating on linear constant-coefficient dif- 
ferential equations is that there is a large body of theory upon which we can 
call in order to solve them. The next subsection illustrates this. 


The principle of superposition 


A key theoretical result will turn out to be extremely useful throughout this 
unit. This is known as the principle of superposition, and is the fundamental 
property of linear differential equations. 


Suppose that we have a solution yi(a) of 


d’y dy 
ag tb + ey = file Bs 
and a solution yo(x) of 
d?y dy 
a + b + cy = fo(a ). 


Then we claim that the linear combination kyy; + kayo, where k; and kg are 
constants, is a solution of 


ty dy 

eee 7 toe + cy = ky fi(x) + ke fo(z). (1.2) 
In fact, this is easy to see, for if we substitute kyy, + keye directly into 
Equation (1.2), we obtain 


2 


d 
as (kin + kaye) + ae (kiyn + kaya) + c(kiys + oye) 


fg d? d d 
=a (ut +n?) +b (nH + ky *) +c¢(kiyi + kayo) 


d? d a d 
= by (a Te + 0 + en) + (052 +02 + on] 
dx? d x x 


= ky fi(z) + ke fo(z), 
as required. 


We summarize this important result as a theorem. 


Theorem 1.1 Principle of superposition 


If yi(x) is a solution of the linear second-order differential equation 


at dy “ 
and sol Jisa ei of the linear second-order differential equation 
d*y 


dy 
aay tbe + ey = fale x) 


(with the same left-hand side), then the function 
y(x) = kiys(@) + keyo(a), 


where k, and ko are constants, is a solution of the differential equation 


dy d 
a2 7 tbe + cy = ky f(x) + ke fo(a). 


Here a, b and c can be 
functions of x. 
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1.2. Method of solution 


This subsection develops a method for solving homogeneous linear constant- 
coefficient second-order differential equations, i.e. equations of the form 


d’y dy 
aT aes +cy =0, (1.3) 


where a, b, c are constants and a 4 0. 


To see how this method arises, consider the first-order differential equation 


dy 
as —0 1.4 
nt Y= (1.4) 


where b and © are constants and b 4 0. This is a homogeneous linear equa- 
tion; as can be shown using the integrating factor method from Unit 2, this 
has a general solution of the form y = Ae**, where A is an arbitrary con- 
stant and A is some fixed constant. To find A, we could solve the equation 
as in Unit 2; alternatively, we can substitute y = Ae*” into Equation (1.4). 
Then we have dy/dx = \Ae**, and so 


po + cy = b\Ae** + cAe*® = (b’ + c) Ae”. 


Therefore, for y = Ae** to be a solution, (b\ + c)Ae*” must be zero, for 
all x. Since A is arbitrary and e** > 0, for all «, we must have bA+c=0, 
ie. A=—c/b. 


This useful idea of substituting y = Ae*” as a trial solution can be applied 
to Equation (1.3) as well. Let us suppose that Equation (1.3) has a solution 
of the form y = Ae**, for some value of \. If so, then dy/dx = \Ae** and 
d’y/dx? = ? Ae”, and substituting into the left-hand side of Equation (1.3) 
gives 


dy od 
J pe cy = ad? Ae™ + bAAe™ + cAe™ 


“da? da 
= (ad? + bA +c) Ae**. 


Hence y = Ae** is indeed a solution of Equation (1.3), for any value of A, 
provided that X satisfies 


a? +bA+c=0. (1.5) 


Equation (1.5) plays such an important role in solving linear constant- 
coefficient second-order differential equations that it is given a special name. 


Definition 


The auxiliary equation of the homogeneous linear constant-coefficient 
second-order differential equation 


dy dy 
ly a = 
a cf) TF ae + cy =0 
is the quadratic equation 
ad’? + bA+e=0. (1.5) 


The auxiliary equation is obtained from the differential equation by replacing 
d d? 

y by 1, me by A, and ae by ?. 
dx dx? 
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Note that the discussion here 
applies irrespective of 
whether 4 is real or complex. 
The consequences of A being 
complex are explained later. 


The auxiliary equation is 
sometimes called the 
characteristic equation. 


Section 1 Homogeneous differential equations 


Example 1.2 
Write down the auxiliary equation of the differential equation 
d’y dy 
3— —2— + 4y =0. 
dx? dx = 
Solution 


The auxiliary equation is 


3\7-2\+4=0. 


*Exercise 1.2 


Write down the auxiliary equation of each of the following differential equa- 


tions. 
d? d 
(a) F5-55-+6y=0 — (b) y’-9y=0 (ce) €4+2e=0 


Now, so far, we know that y = Ae** is a solution of Equation (1.3) pro- 

vided that A satisfies its auxiliary equation. But the auxiliary equation is 

a quadratic equation with real coefficients, and so has two roots (which in The roots of a quadratic 

general are distinct). These two roots, 1 and 2 say, give two solutions equation were discussed in 

yy = Ce™* and yo = De*2* of Equation (1.3), where C and D are arbitrary Unit 1. 

constants. If Ay = Ae, then we obtain 
only one solution. This case is 


Example 1.3 dealt with separately below. 


(a) Write down the auxiliary equation of the differential equation 


and find its roots Ay and Ag. 


(b) Deduce that y, = Ce” and y2 = De?” are both solutions of the differen- 
tial equation, for any values of the two constants C' and D. 


(c) Show that y = Ce® + De?* is also a solution of the differential equation, 
for any values of the two constants C’ and D. 


Solution 


(a) The auxiliary equation is 


MW — 3A +2=0. 
This equation may be solved, for example, by factorizing in the form Using the formula 
(A — 1)(A — 2) = 0, to give the two roots A; = 1 and Ag = 2. ee —pita/§? hae 
1,A2 = ——3- 


. = _ ye . a x 2a 
(b) Since \y = 1 and Az = 2 are the roots of the auxiliary equation, y; = Ce ‘st linia Glin uate aR: 


and y2 = De?* are solutions of the differential equation, for any values Fe doce wot wincear-e cian 


of C and D. the roots is called A; and 


(c) To show that y = Ce® + De?* is a solution of the differential equation, wht cle 


we differentiate and substitute into the differential equation. Differen- 
tiating to obtain the first and second derivatives of y gives 


d 
7 = Ce® + 2De* 
r 
and 
2 
“ = Ce® + 4De** 
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Substituting these into the left-hand side of the differential equation 


gives 
d*y dy 
ge 


= (Ce* + 4De?*) —3 (Ce® + 2De”") +2 (Ce® + De**) 
= C(1—3+2)e" + D(4—6 + 2)e* 
= 0. 


Hence y = Ce” + De?” is a solution of the differential equation, for any 
values of Cand D. Hi 


In Subsection 1.3 we shall prove that if A; and Ag are distinct roots of the 
auxiliary equation of a homogeneous linear constant-coefficient second-order 
differential equation, then any solution is of the form 


y = Ce™” + De*, (1.6) 


for some choice of constants C and D. 


*Exercise 1.3 


Use the auxiliary equation to find the general solution of each of the following 
differential equations. 

dy dy d?y dy d7z 
(a) 


245 -Gj=0 “(b) 2-243. =0 


dx? dx dx? daz: (¢) du2 meee 


We now consider an example where the two roots of the auxiliary equation 
are equal, in which case the above recipe does not work! Indeed, in light 
of the earlier discussion, you might expect the solution always to be of the 
form y = Ae*!* + Be*2*, where A and B are arbitrary constants. But if 
dt = Az, this reduces to y = (A+ B)e** = Ce”, where C= A+ Bisa 
single arbitrary constant, so this cannot be the general solution of a second- 
order differential equation. 


Example 1.4 

(a) Write down the auxiliary equation of the differential equation 
dy dy 

dee + Oa + 9y = 0, 

and find its roots A; and Ad. 


(b) Deduce that y; = Ce~** is a solution of the differential equation, for 
any value of the constant C. 


(c) Show that yo = Dxe~** is also a solution, for any value of the con- 
stant D. 


(d) Deduce that y = (C + Dx)e~** is also a solution of the differential equa- 
tion, for any values of the two constants C and D. 


Solution 


(a) The auxiliary equation is 


M+ 6A+9=0. 
The left-hand side is the perfect square (A + 3)?, so the auxiliary equa- 
tion has equal roots Ay = A2g = —3. 
(b) Since \y = —3 is a root of the auxiliary equation, y, = Ce~** is a solu- 


tion of the differential equation, for any value of C. 
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Note that the ‘other’ root 
Az = —3 gives the same 
solution. 
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(c) To show that yo = Dre~* is a solution of the differential equation, we 
differentiate and substitute into the differential equation. Differentiating 
to obtain the first and second derivatives of yo gives 


d 
_ = De*" + Da (—3e7°") = Dili —82)e"*, 
ie 

tye _ _3pe-% 4 D(1 — 32)(—3e-8*) = D(—6 + 9u)e“* 

aa? = ~8De + D(1 — 3x)(—3e7°*) = D(-6 + 9x)e~**. 
Substituting these into the left-hand side of the differential equation 
gives 

d?ys dy2 = —32x —32 —32 

—~= + 6— + 9yp = D(-6 + 9x)e""* + 6D(1 — 8x)e"** + 9D xe 

dx? dx 


= D(-6 + 6)e"** + D(9 — 18 + 9)xe“** 
=0. 


Bi 


Hence y2 = Dre~** is a solution of the differential equation, for any 


value of D. 


(d) Since y, = Ce~** and yo = Dxe~*" are both solutions of the differential 
equation, the principle of superposition (Theorem 1.1) tells us that so is 
y = Ce + Dare~** = (C + Dax)e~**, for any values of C and D. 


The solution in Example 1.4 is of the form y = Ce™!* + Daxe*!*. The extra x 
in the second term, De, is needed, in this special case, to incorporate the 
second arbitrary constant required by the general solution of a second-order 
differential equation. 


In general, when Ay = Az, y = xe! 


To see this, differentiate twice to obtain 


d 

oY _ pdt 4 ive!” = (14 Apz)e™*, 

dx 

Ot o%, AMP 4 Ny (1+ Aywje™™ = (2A, + Aja)e™™™ 

dee = Aje 1 1v/)E = 1 trie ; 

and substitute into the left-hand side of Equation (1.3) to obtain 

@y dy 
ol ay td 

da? us dx ee 


=a ((2a1 + Xinje™*) +b ((1 + Aizje™*) +¢ (we) 
ent (a(2A, + Nix) + b(1 + Aix) + cx) 
= e™ ((2a\ +b) + (@A7 + DAY + Ga). (1.7) 


Since A, is the solution of the auxiliary equation, we have ant +bd\; +c=0. 
Also, the formula method for solving the auxiliary equation a\? + b\ +c =0 
gives 


—b+ Vv b2 = 4ac 

2a : 
since in this case we have equal roots, we must have b? — 4ac = 0, so 
A, = —b/2a, and therefore 2a\; + 6 = 0. Thus the right-hand side of Equa- 
tion (1.7) is zero, and y = xe*!” is indeed a solution of Equation (1.3). 
Therefore, when A; = A2, by the principle of superposition, 


y = Ce” + Dre™® = (C+ Dr)e™*, (1.8) 


j= 


where C’ and D are arbitrary constants, is always a solution of Equa- 
tion (1.3). In fact, as you will see in Subsection 1.3, Equation (1.8) gives 
the general solution. 


* is a solution of Equation (1.3) (page 114). 


Here we are using the Product 
Rule for differentiation. 
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*Exercise 1.4 


Use the auxiliary equation to find the general solution of the following dif- 
ferential equations. 


(b) §$-48+4s =0 


Equations (1.6) and (1.8) give us the general solution of Equation (1.3) for 
the cases where the roots A; and 2 of the auxiliary equation are distinct 
and equal, respectively. However, the distinct roots of a quadratic equation 
may not be real — they could consist of a pair of complex conjugate roots 
Ay = a+ fi and Az =a — (i. If the auxiliary equation has such a pair of 
roots, we can still write the general solution in the form 
y= Ae™* + Ber _ Aclot Bix es Belo Fix, 
but we now have a complex-valued solution. 
Since Equation (1.3) has real coefficients, we would like a real-valued solu- 
tion. In order to achieve this, we shall need to allow A and B to be complex. 
Then we can use Euler’s formula, which tells us that 
e'F® — cog Bx +isinBe and e~*” = cos Bx — isin Be. 
Now 
y = Ae” + Be2* 
= Aelat Bix + Belo-Fie 
_ Ae eibx a Bet e7ib= 
= Ae” (cos Gx + isin Br) + Be (cos Gx — isin Gr) 
= e®((A+ B) cos Gx + (Ai — Bi) sin Br) 
= e®" (C'cos Ba + Dsin Br), 
where C = A+ B and D=(A-— B)i. Provided that any initial conditions 


are real-valued, C and D are real, and this is the required real-valued solution 
containing two arbitrary constants. 


Example 1.5 
(a) Write down the auxiliary equation of the differential equation 
dy dy 
ot 26 + 137 =0 
de? dx 4” 
and show that its roots are \y = 3+ 27 and Ag = 3 — 2i. 


3 


(b) Confirm that y= e?* cos 2a and yo = e°” sin2x are both solutions of 


the differential equation. 


(c) Deduce that y = e**(C cos 2x + Dsin2z) is also a solution of the differ- 
ential equation, for any values of the two constants C' and D. 


Solution 
(a) The characteristic equation is 
7 —6\+ 13 =0. 
The formula method gives 


61+/36-4x1 x 13 _ 6+ 
2 7 2 
so the two complex conjugate roots are A; = 3+ 27 and Ag = 3 — 2i. 


—16 


A= = Ose 20, 
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Recall that the complex 
conjugate of the complex 
number a+ (i is a — (i. 


You will soon see why we use 
A and B for the arbitrary 
constants (rather than our 
usual choice of C and D). 


Euler’s formula was discussed 
in Unit 1. 


The constants in the final 
expression are now C' and D, 
in keeping with our previous 
solutions. 


With the previous notation 
we have a= 3 and G=2. 
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(b) To confirm that y,; = e*” cos 22 is a solution of the differential equation, 
we differentiate and substitute into the differential equation. Differen- 
tiating to obtain the first and second derivatives of y; gives 


d 
_ = 3e?* cos 2x + eP* (9 sin 22) 
x 
= e**(3 cos 2x — 2sin 22), 
d2 


7a = 3e°*(3 cos 2x — 2sin 2x) + e®*(—6 sin 2x — 4cos 22) 
x 


= e**(5 cos 2a — 12sin 22). 
Substituting these into the left-hand side of the differential equation 
gives 


a? d 
alte ee 13y1 = e°"(5 cos 2x — 12 sin 2x) 


aa? d 
e . — 6e** (3 cos 22 — 2sin 2x) + 13e°” cos 2x 
= e** [(5 — 18 + 13) cos 2a + (—12 + 12) sin 22] 
= 0. 


3 


Hence y, = e°*” cos 2 is a solution. 


Similarly, for y2 = e** sin 2x we have 
d 
oe ce?" (2 cos 2x + 3sin 22), 
dx 
d2 
ne — e®*(12cos 2x + 5sin 22), 


and substituting into the left-hand side of the differential equation gives 


e?” (12 — 12) cos 2x + (5 — 18 + 13) sin 2z] 
=0. 


| 
aD 
— 
hb 
aw 
nN 

I 


3 


Hence yg = e°” sin 27 is also a solution. 


(c) Since y, = e?* cos2x and yo = e®” sin 2x are both solutions of the dif- 
ferential equation, the principle of superposition (Theorem 1.1) tells us 


that so is 


y = Ce** cos 2x + De® sin 2x 
= e**(Ccos 2 + Dsin2z), 


for any values of C and D. Hi 


*Exercise 1.5 


Use the auxiliary equation to find the general solution of each of the following 
differential equations. 
dy dy 
—5+4—+8y=0 
ge ae 
d?0 
b) — = 
(b) qe + 96 = 0 


We now summarize the method of solving these differential equations as a 
procedure. 


119 


Unit 3 Second-order differential equations 


Procedure 1.1. The solution of homogeneous linear constant- 
coefficient second-order differential equations 


The general solution of the homogeneous linear constant-coefficient 
second-order differential equation 

d? d 

faa fd 

dx? dx 


where a, b, c are (real) constants and a 4 0, may be found as follows. 


+cy =0, (1.3) 


(a) Write down the auxiliary equation 
ad? + brA\ + e=0, (1.5) 
and find its roots A; and Ag. 


(b) (i) If the auxiliary equation has two distinct real roots 1 and Ag, 
the general solution of the differential equation is 


y = Ce™* + De®. 


(ii) If the auxiliary equation has two equal real roots 
Ay = 2, the general solution of the differential equation is 


y= (C+ Da)e™*. 


(iii) If the auxiliary equation has a pair of complex conjugate roots 
Ay = a+ Gi and Ag = a — (i, the general solution of the differential 
equation is 


y = e*" (C cos Bx + Dsin Ba). 


In each case, C' and D are arbitrary constants. 


It is worth noting that the three cases in part (b) of Procedure 1.1 corre- 
spond to three different possibilities that arise when solving the characteris- 
tic equation ad? + bA +c = 0. These three different possibilities relate to the 
value of the discriminant b* — 4ac: b? — 4ac > 0 corresponds to case (i), 
b? — dac = 0 to case (ii), and b? — 4ac < 0 to case (iii). 


*Exercise 1.6 
Find the general solution of each of the following differential equations. 
dy 


(b) u(x) — 6u'(x) + 8u(ax) = 0 


dy dy 
ee Oe, 
(c) dx? 7 dx : 
d’y dy 
8) G2 ay t= 9 
dy 
e) —+ —w?y =0, where w is a real constant 
dx? 


d*y dy 
f) — td a n= 0 
(f) 72 t Fes Oy 
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Exercise 1.7 


Small oscillations of the pendulum of a clock can be modelled by the differ- 
ential equation 


- a _§ 
d= 0, 7 
where g is the magnitude of the acceleration due to gravity, | is the length : 
of the pendulum, and @ is the angle the pendulum makes with the vertical 
(see Figure 1.1). Solve the differential equation to obtain an expression for 
@ in terms of g and I. 
1.3 The general solution 
Figure 1.1 
In this subsection we prove that the solutions we discovered in Subsection 1.2 
are indeed the most general solutions of Equation (1.3). This proof is in- 
cluded for completeness, and you will not be expected to reproduce it; how- 
ever, it does provide some useful revision of the integrating factor method 
from Unit 2. 
Theorem 1.2 
Suppose that the roots of the equation 
ad? +bA+c=0 
are Ay and Ag. Then the general solution of the second-order linear 
constant-coefficient differential equation 
d’y , dy 
oe pe = 1.3 
aT Si da + cy (1.3) 
can always be written in the form 
( Cem= + De2* (Ay t d2); 
‘a 
# (C+ Dz) (4 ==). 
In order to prove this it will be convenient to divide through Equation (1.3) 
by a, obtaining the equivalent form 
d*y bdy c¢ 
= 0. 1.9 
dx? . adz ‘ a (1.9) 
Now we resort to a trick, in order to obtain a first-order differential equation 
d 
in w= a — ky, where k is a constant yet to be determined. 
x 
d d? d 
Noting that lager dey | a Equation (1.9) can be written first as 
dx dx? dx 
dw b dy oc Note that we have eliminated 
dx a dx a the ae term. 
Xv 


d 
and then, substituting iL =w+ky, as 
x 


b 
m+ (248) (w + ky) + <y =0. 
x a a 


Simplifying, this yields 


d b b 
v4 (Fak)we(S+e(2 +e) )y=0 
dx a a a 
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Now, choosing k to be a root of the quadratic equation 
b c 
iP +okt+—=0, 
a a 


which simplifies to 


ak? + bk +c=0, (1.10) 
we indeed arrive at a first-order differential equation for w: 

ee (2+) w=0. (1.11) 

da a 


Of course, Equation (1.10) is the auxiliary equation associated with Equa- 
tion (1.3), with roots 


b+ V/b? — 4ac 

2a , 
so these are the values of k that we may choose. Before proceeding, notice 
that the sum of the two roots satisfies 


b 
Ay + Ag = =S 


A1, A2 = — 


For the sake of definiteness, choose k = A,. The above differential equa- 
tion (1.11) for w may therefore be written as 


e+ (2+a)w=0. 


dx 
But since A; + A2 = —b/a, we have 
b 
— +A, = Ad. 
a 
Thus Equation (1.11) becomes 
dw 
— — ww =0. 
da sa 


This is a first-order linear differential equation, and in principle we could 
solve it using the integrating factor technique from Unit 2. However, the 
equation is of a particularly simple form that we have seen before, so we can 
write down the solution as 


w= Aer”, 


where A is an arbitrary constant. Now, remembering that 
dy dy 


= n= d 
ded ae ae 
we arrive at a first-order differential equation for y: 
d ; 
°F _ \yy = Ae. (1.12) 
dx 


This is a non-homogeneous linear first-order differential equation that we 
can solve, once again using the integrating factor method of Unit 2. The 
integrating factor is 


exp (/ — jy i) = ee, 


Multiplying through Equation (1.12) by this factor produces the equation 


oie Y _ Mey _ AeQ2—1)# 
dx 
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It does not matter which root 
we choose, since there is no 
particular significance in the 
labelling of A, and Ag. 
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or 

d —Ayx (A2—A1) 2 

Te (ve ) = Ae ; (1.13) 
We now have two cases to consider. If A2 — A, # 0, then integrating both This is the case when the 
sides yields auxiliary equation has 

A distinct roots. 
—A\ x _ (Ag—A1)2 
€ = ————e +C 


where C is an arbitrary constant. Multiplying through by e™”, we arrive at 
the explicit form of the general solution 


E27 4 CieAt, 


ey Ce © 
Finally, replacing the arbitrary constant A/(A2 — 1) by D gives the required 
result: 


y = Ce™* + De®. 


If Ay = Az = A, say, then Equation (1.13) becomes This is the case when the 
d auxiliary equation has equal 
ee (ye*”) -A roots. 
dx ; 


and integrating gives 
ye” = Ar +C, 


where C is an arbitrary constant. Re-labelling A as D, and multiplying 
through by e*”, we arrive at the explicit form of the general solution 


y= (C+ Dza)e. 
This completes the proof of Theorem 1.2. 


End-of-section Exercises 


Exercise 1.8 


(a) Write down the auxiliary equation of the differential equation 


(b) Solve this auxiliary equation. 


(c) Write down the general solution of the differential equation. 


Exercise 1.9 


Find the general solution of each of the following differential equations. 


d?y dy dy 
dy dy a6 do 


Exercise 1.10 
For which values of the constant k does the differential equation 


dy dy 
— + 4k— + 4y = 0 
dx? = dx oa 
have a general solution with oscillating behaviour, that is, a general solution 


which involves sines and cosines? 
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2 Inhomogeneous differential equations 


Section 1 was concerned with finding the general solution of homogeneous 
linear constant-coefficient second-order differential equations. This section 
concerns inhomogeneous linear constant-coefficient second-order differential 
equations, i.e. equations of the form 


dy _,dy 

b = ; 2.1 

da? + dx hie = i) 2) 

where a, b, c are real constants, a # 0, and f(x) is a given continuous real- 
valued function of x. 


Subsection 2.1 gives the general method for constructing solutions of Equa- 
tion (2.1). Subsection 2.2 shows how to find an appropriate particular 
solution of the differential equation, for use in constructing the general so- 
lution, in cases where the function f(x) takes one of a few particular forms. 
Subsection 2.3 deals with certain cases where complications can arise. Sub- 
section 2.4 shows how to deal with cases where f(x) is a combination of the 
functions discussed in Subsection 2.2. 


2.1 General method of solution 


The basic method used for finding the general solution of Equation (2.1) 
depends on the principle of superposition (Theorem 1.1), and is illustrated 
in the following example. 


Example 2.1 


Show that y = Ce~?* + De~*” + 2 is a solution of the inhomogeneous dif- 
ferential equation 


dy  _dy 
a2 tog, t by = 12 (2.2) 


for any values of the constants C and D. 


Solution 
We know from Exercise 1.3(a) that the homogeneous differential equation 
d’y dy 
5 6y = 0 2.3 
dx? * dx we (2.3) 
has a general solution ye = Ce~?” + De~**, where C and D are arbitrary 
constants. 


Now consider the constant function yp = 2. This is a particular solution of 
Equation (2.2) since d?y,/dx? = dyp/dx = 0 and 6yp = 12. 


Therefore, by the principle of superposition (Theorem 1.1), 
Y=YerYy = Ce + De 49 
is a solution of Equation (2.2), for any values of Cand D. 


Equation (2.3) is an example of an associated homogeneous equation — 
i.e. the homogeneous equation associated with the inhomogeneous equa- 
tion (2.2) by making its right-hand side zero. The solutions y, and yp also 
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The notation y, and yp will 
be explained shortly. 
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have special names in this context: y-, the general solution of the asso- 
ciated homogeneous equation (2.3), is called the complementary function, 
and yp, a particular solution of the inhomogeneous equation (2.2), is called 
a particular integral. 


Definitions 


Let 
d*y dy 
a— +b—+cy= f(a 
be an inhomogeneous linear constant-coefficient second-order differen- 
tial equation. 


(2.1) 


(a) Its associated homogeneous equation is 


dy dy 
a—, +b—+cy=0. 
da? dx | 
(b) The general solution y. of the associated homogeneous equation is 
known as the complementary function for the original inhomo- 


geneous equation (2.1). 


(c) Any particular solution yp) of the original inhomogeneous equa- 
tion (2.1) is referred to as a particular integral for that equation. 


Later in this section we shall show how to find particular integrals for a wide 
variety of equations. Before we do that, it is important to realize the full 
significance of finding just one particular integral. 


*Exercise 2.1 


Suppose that we have found two different particular integrals yp,, yp, for 
Equation (2.1). Use the principle of superposition to show that the function 
Yp, — Yp, is then a solution of the associated homogeneous equation. 


The result of Exercise 2.1 shows the true significance of finding a particular 
integral. For if we do so then, since from Section 1 we know how to solve 
the associated homogeneous equation, we can find all particular integrals 
simply by adding the complementary function. We have proved the following 
important result. 


Theorem 2.1 


If ye is the complementary function for an inhomogeneous linear 
constant-coefficient second-order differential equation, and yp is a par- 
ticular integral for that equation, then y. + yp is the general solution 
of that equation. 


Note that y., being the general solution of the associated homogeneous equa- 
tion, will contain two arbitrary constants, whereas y,, being a particular 
solution, will contain none. 


Let us now see how the method based on Theorem 2.1 can be applied. 


The term particular integral 
is used here, rather than the 
term particular solution used 
in some other texts, to 
distinguish it from the 
particular solution to 
Equation (2.1) that satisfies 
given initial or boundary 
conditions (see Section 3). 


That is, we can find the 
complementary function. 
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Example 2.2 
Find the general solution of the differential equation 

dy 

age + 9y = 9x + 9. (2.4) 
Solution 


The associated homogeneous equation is 


dy 
—~ + 9y =0 
dx2 =r Yy ? 


which has the general solution 


Yc = Ccos 3a + Dsin32. See Exercise 1.5(b), although 
i. : : there different symbols were 
This is the complementary function for Equation (2.4). used for the variables, 
A particular integral for Equation (2.4) is You will see in the next 
subsection how to find such a 
Y =xr+l. particular integral. 


This may be verified by differentiation and substitution: y/, = 1 and v5 = 0, 
and substituting into the left-hand side of Equation (2.4) gives 


py + 9p =04+9(e +1) = 9x +9, 
which is the same as the right-hand side of Equation (2.4), as required. 
The general solution of Equation (2.4) is, therefore, by Theorem 2.1, 
Y=Vetyp = Ceosd2+Dsngrc+2+1, 


where C and D are arbitrary constants. MH 


The method of Example 2.2 may be summarized as follows. 


Procedure 2.1. |The solution of inhomogeneous linear constant- 
coefficient second-order differential equations 


To find the general solution of the inhomogeneous linear constant- 
coefficient second-order differential equation 


dy dy 
adit Ag = 
da? * dx Pe Im) 
(a) find its complementary function yc, i.e. the general solution of the The reason why y, is found 
associated homogeneous differential equation first will become clear in 
a2 F Subsection 2.3. 
7] 7] 
we p= 0, 
da? 7 dx rey 


using Procedure 1.1; 
(b) find a particular integral yp. 
The general solution is y = Yc + Yp. 


It is worth noting that, by Theorem 2.1, any choice of particular integral 
in Procedure 2.1 gives the same general solution. The formula obtained 
for the general solution may look different for different choices of particular 
integral, but they are in fact always equivalent. For example, in Example 2.2 
the particular integral y, = «+ 1 was chosen, and the form of the general 
solution was obtained as y = Ccos3a+ Dsin3x2+a2+1. It would have been 
equally valid to have chosen, as a particular integral, y, = x + 1+ sin3z. 
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In that case, the form of the general solution would have been obtained as 
y = Ccos3xa+ Dsin3xz+2+1+sin3z. This looks a little different, but it 
may be written in the form y = Ccos3z + (D+ 1)sin3z +a +1; and, since 
C and D are arbitrary constants, this form of the general solution represents 
exactly the same family of solutions. 


*Exercise 2.2 


Consider the following differential equations. 
a d? d 

(a) 4+4y=8  (b) ais aE ep 2G See Exercise 1.6(a) and 
dx dx dx Example 1.3. 

For each equation: 


e write down its associated homogeneous equation and its complementary 
function Ye} 

e find a particular integral of the form y, = p, where p is a constant; 

e write down the general solution. 


When using Procedure 2.1, the complementary function is found by using 
Procedure 1.1. However, the procedures for finding a particular integral are 
another matter. In Exercise 2.2, where the right-hand sides of the equa- 
tions are constants, it was possible to find a particular integral almost ‘by 
inspection’; but this method is generally inadequate. Fortunately, there ex- 
ist procedures for finding a particular integral for equations involving wide 
classes of right-hand-side functions f(a). The remainder of this section con- 
siders some of the simpler cases, where it is possible to determine the form of 
a particular integral by inspection, although some manipulation is required 
in order to determine the values of certain coefficients. 


2.2 Finding a particular integral by the method of 
undetermined coefficients 


In the previous subsection you saw that the inhomogeneous linear constant- 
coefficient second-order differential equation 
d? d 

ag tb= + ey = fe) (2.1) 
can be solved by first solving the associated homogeneous equation, using 
the methods of Section 1, and then finding a particular integral of Equa- 
tion (2.1), which depends upon the function f(z). This and the next two 
subsections show you how to find a particular integral when f(a) is a poly- There exist procedures for 
nomial, exponential or sinusoidal function, or a sum of such functions. finding a particular integral 

for fairly general types of 

You saw an example of the approach in Exercise 2.2. There the functions continuous function f(x), but 
f(a) were constants and you tried a constant function y = p as a particular these are not considered in 
integral, substituting into the differential equation to find a suitable value this course. 
for p. In general, we try a function of the same form as f(x) as a partic- 
ular integral, and substitute into the differential equation to find suitable 
values for its unknown coefficients. The function that we try is known as a 
trial solution, and the method is known as the method of undetermined 
coefficients. 


The following examples illustrate the method. Bear in mind, though, that 
the method (and hence these examples) finds only a particular integral for 
the differential equation; to find the general solution you would need to find 
the complementary function and combine it with the particular integral, 
according to Procedure 2.1. 
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al 


A polynomial function (f(x) = myx" + mp_12" + +--+ +m x2 + mo) 


Let us first consider a case where f(x) is a linear function (i.e. a polynomial 
of degree 1). 


Example 2.3 
Find a particular integral for 
dy dy 
3— -— 2— = 4742. 
de? “dx | 9" 
Solution 


We try a solution of the form 


Y= pix + po, 


where p; and po are coefficients to be determined so that the differential 
equation is satisfied. To try this solution, we need the first and second 
derivatives of y: 


dy _ dy _¢ 
de? Ga? — 
Substituting these into the left-hand side of the differential equation gives 
d? d 
3-5 —2F +y=3 x 0 — 2p; + (pit + po) = pix + (po — 2p1). 


Therefore, for y = pjx + po to be a solution of the differential equation, we 
require that 


pix + (po — 2p;) =4"+2 for all x. (2.5) 


To find the two unknown coefficients p; and po, we compare the coefficients 
on both sides of Equation (2.5). Comparing the terms in x gives p; = 4. 
Comparing the constant terms gives po — 2p; = 2, so that pp = 24+ 2p, = 
2+2x4= 10. Therefore we have the particular integral 


Yp = 4x + 10. 


Checking: if y= 4x2 +10, then dy/dx = 4, d*y/dx? = 0, and substituting 
into the left-hand side of the differential equation gives 
d*y dy 
372 egg TYR SKO-2x 4+ (40 + 10) = 4g + 2, 


as required. I 


You will have noticed in Example 2.3 that substituting a linear trial solution 
y = pix + po into the left-hand side of the differential equation resulted 
in a linear function, namely p,x + (po — 2~p1), whose coefficients could be 
compared with those of the linear target function 4% + 2 to obtain values 
for py and po. This is really the key to the method. If the target function 
is linear, choosing a linear trial solution ensures that substituting into the 
left-hand side of the differential equation results in a linear function whose 
coefficients can be compared with those of the target function. Similarly, 
as you will see below, if the target function belongs to certain other classes 
of functions, choosing as a trial solution a general function from that class 
ensures that substitution into the left-hand side of the differential equation 
produces another function from the same class, whose coefficients can be 
compared with those of the target function, thus enabling values to be given 
to the coefficients of the trial solution. The method will work provided that 
all the derivatives of functions in the class are also in the class. 
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Comparing coefficients works 
because two polynomials are 
equal if and only if all their 
corresponding coefficients are 
the same. 


Section 2 Inhomogeneous differential equations 


*Exercise 2.3 


Find particular integrals of the form y = pix + po for each of the following 
differential equations. 
d?y dy dy 


dy 


Note that, in Exercise 2.3(b), although f(a) is just a multiple of xz, it is not 

possible to find a solution of the form y(x) = pix. It is necessary for the Try it and see what goes 
trial solution to contain terms like those in f(a) and all its derivatives, so wrong. 

that in this case the trial solution must be of the form y = p1x+ po. So, 

in general, even if mp = 0 in f(x) = m,x+ mo, so that f(x) = m2, you 

should use a trial solution of the form y = pix + po. 


However, if f(z) = mo is a constant function, then the trial solution need You saw examples of this in 
only be a constant function y = po. Exercise 2.2. 


In general, if f(z) = mpx™ + mp_i2™ 14+---+m,2+ mo, where my, £# 0, 
then a trial solution of the form y = pynx” + pn_y2"—! +--+ + py2+ po should 
be used. 


*Exercise 2.4 


Find a particular integral for 


y’ —y=t?. 


An exponential function (f(x) = me**) 


Example 2.4 
Find a particular integral for 
*y 3 
7) os 9y = 2e°”. 
Solution 


We try a solution of the form 


y = pe*, Since the derivative of e3” is 
3e3”, the exponent (32) 


where p is a coefficient to be determined so that the differential equation is appearing in y(x) should be 


satisfied. Differentiating y = pe®” gives the same as that appearing in 
a 2 f(a), and only the coefficient 
oe es 3pe*", ee 9pe>”, p is to be determined. 
dx dx? 
Substituting these into the left-hand side of the differential equation gives 
Qy 


i a 9y = 9pe?® + Ipe** = 18pe%* 


Therefore, for y = pe®* to be a solution of the differential equation, we 
require that 18pe3* = 2e° for all x. Hence p = $, and 


3x 
Yp = ge 


is a particular integral for the given differential equation. Hl 
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*Exercise 2.5 


Find a particular integral for 


d? d 
A tl +y=2e”. 


gt 7 
dx? dx 


A sinusoidal function (f(x) = mcosQz 4+ nsinQz) 


Following on from earlier ideas, the trial solution must contain terms like 
those in f(x) and all its derivatives; so, even if f(x) contains only a sine or 
only a cosine, the trial solution y(z) must contain both a sine and a cosine. 
However, the value of the parameter 2 appearing in y(x) should be the same 
as that appearing in f(z). 


Example 2.5 

Find a particular integral for 
d?y dy 
da? + or + 2y = 10sin 2a. 


Solution 
We try a solution of the form 
y = pcos2z + qsin 22, 


where p and q are coefficients to be determined so that the differential equa- 
tion is satisfied. Differentiating y gives 


d d? 
= —2psin 2x” + 2qcos 22, oe —4p cos 2x — 4qsin 22. 
dx dx? 
Substituting these into the left-hand side of the differential equation gives 
a d 
i + 2 + 2y = (—4pcos 2x — 4qsin 22) 


+ 2(—2psin 2x + 2q cos 2x) 
+ 2(pcos 2x + qsin 2x) 
= (—2p + 4g) cos 2x + (—4p — 2g) sin 22. 


Therefore, for y = pcos 2a + qsin 2x to be a solution of the differential equa- 
tion, we require that 


(—2p + 4q) cos 2a + (—4p — 2g) sin 2a = 10sin2z for all z. (2.6) 


To find p and gq, we compare the coefficients of cos and sin on both sides 
of Equation (2.6). Comparing cos terms gives —2p + 4q = 0. Comparing 
sin terms gives —4p — 2q = 10. Solving these simultaneous equations gives 
p=-—2,q=-—1. Hence 


Yp = —2cos 2x — sin 2x 


is a particular integral for the given differential equation. IH 


*Exercise 2.6 
Find a particular integral for 


a? d 
We _ Lh = cos 3t+ sin 3t. 
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This type of function f(x) is 
particularly important in 
many practical applications. 


Comparing coefficients works 
because the cosine and sine 
functions are linearly 
independent: 

if asinra + bcosra = 0 for 
all x, then a= b= 0. 


Section 2 Inhomogeneous differential equations 


The following procedure summarizes the results of this subsection. 


Procedure 2.2. Method of undetermined coefficients 


To find a particular integral for the inhomogeneous linear constant- 
coefficient second-order differential equation 
2 
at + po +cy = f(z), 
use a trial solution y(x) that has a form similar to that of f(x). For 
simple forms of f(x), the following table gives the appropriate form of 
trial solution. 


f(x) Trial solution y(a) 
Myt” + mye Hee tmx timo pre” + pn—12" +--+ p12 + po 
meke pe 


mcos Qe + nsin Qa pcos Qa + gsin Qa 


To determine the coefficient(s) in y(x), differentiate y(a) twice, sub- 
stitute into the left-hand side of the differential equation, and equate 
coefficients of corresponding terms. 


Exercise 2.7 


What form of trial solution y should you use in order to find a particular 
integral for each of the following differential equations? 


dy o 
(a) co a y= e° 


*Exercise 2.8 


Find the general solution of each of the following differential equations. 


2 d? 0 
i) dy = _ a = 9cos 3t 


Exercise 2.9 


A long horizontal rectangular beam of length / rests on rigid supports at 
each end. It is important in civil engineering to determine how much such a 
beam ‘sags’. A simple model of this ‘sag’, or vertical displacement y, is the 
differential equation 


Ry" — Sy+ 5Q(L-x)z =0, 


where R, S and @ are constants related to the structure of the beam, and 
x is the distance from one end of the beam (as illustrated in Figure 2.1). 
Find the general solution of the differential equation in the case where R, S 
and Q are all equal to 1. 


In Subsection 2.4 you will see how the principle of superposition can be 
used in combination with Procedure 2.2 to solve differential equations whose 
right-hand-side functions f(x) are sums of polynomial, exponential and 
sinusoidal functions. But first let us look at some exceptional cases for 
which Procedure 2.2 does not work and needs to be adapted. 


Note that there are 
exceptional cases where these 
trial solutions do not work. 
See Subsection 2.3. 


The complementary functions 
were obtained in Exercise 1.9 
parts (a) and (d). 


— 


x displacement 


Figure 2.1 
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2.3 Exceptional cases 


There are some exceptional cases for which Procedure 2.2 fails. The aim of 
this subsection is to indicate when such difficulties arise, and how a partic- 
ular integral may be found in those circumstances. Let us begin with an 
example. 


Example 2.6 

Find a particular integral for 
d’y 2 

Solution 


Using Procedure 2.2, let us try y = pe?”. Differentiating this gives 


dy on ay 
—* = 2pe"*, —= = Ape**. 
dx Pe Oy? - 
Substituting these into the left-hand side of the differential equation gives 
dy 


i Ay = Ape** — Ape? = 0. 


So there is no value of p that gives a particular integral of the form y = pe?*. 


The trouble is that the complementary function, i.e. the general solution of 
the associated homogeneous equation 

d?y 

—, —4y =0, 

dx? p 
is y = Ce?” + De?”, thus the trial solution is a solution of the associated 
homogeneous equation (with C = 0, D =p). Hence, on substituting the 
trial solution y = pe?” into the inhomogeneous equation, the left-hand side 
is zero for any value of p, so it cannot be equal to the non-zero right-hand 
side. 


In such circumstances, the difficulty can generally be overcome by multiply- 
ing the trial solution suggested in Procedure 2.2 by x. Thus, in this case, the 
trial solution should be modified to take the form y = pxe?”. Differentiating 
this gives 


d 
oe pe2* + 2pxe?® = p(1 + 2x)e?*, 
dx 
dae = Pe + p(1 + 2x)e™ = 4p(1 + zJe™. 
Substituting these into the left-hand side of the differential equation gives 
d’y dy =4 22 20 __ 2x 
—z — 4y = 4p(1 + x)e" — 4pre** = Ape. 
dx 


Therefore y = pre?” is a solution of the differential equation provided that 
Ape?* = Qe” for all x. Hence p = 7 and 


— ft 22 
Up = 5 ve 


is a particular integral for the given differential equation. Il 
The problem with the trial solution being a solution of the associated ho- 
mogeneous equation can occur irrespective of the form of the trial solution 


(ie. polynomial, exponential or sinusoidal), but in most cases it can be 
overcome by multiplying the trial solution suggested in Procedure 2.2 by x. 
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See Exercise 1.3(c). 


There is an analogy here with 
the case of the homogeneous 
differential equation when the 
characteristic equation has 
equal roots; in that case, 
when e*” is one solution of 
the equation, another solution 
is given by xe>*. 


Section 2 Inhomogeneous differential equations 


When using Procedure 2.2, you should check whether the proposed trial 
solution is a solution of the associated homogeneous equation, and if so try 
multiplying it by z. (This is why it is important to find y, before yp in 
Procedure 2.1.) 


Exercise 2.10 


Find a particular integral for each of the following differential equations. The complementary functions 
d2y dy 2 y dy are given in the solutions to 
(a) Se ae ee Ade (b) 27 +35 = 1 Example 1.3 and 
dx dx dx dx Exercise 1.3(b). 


Exercise 2.11 


The motion of a marble dropped from the Clifton Suspension Bridge into 
the River Avon can be modelled by the differential equation 


mz +rz—mg =), 


where m is the mass of the marble, r is a constant related to air resistance, 
g is the magnitude of the acceleration due to gravity, and x is the vertical 
distance from the point of dropping (as shown in Figure 2.2). Find an 
expression for x in terms of the time t. Figure 2.2 


We have seen that Procedure 2.2 fails if the trial solution is a solution of 
the associated homogeneous differential equation: in such cases we multiply 
the suggested trial solution by x and use this as the trial solution. Another 
situation in which it is necessary to multiply the suggested trial solution by 
x is illustrated in the following example. 


Example 2.7 

Find a particular integral for 
d’y dy 
—54+2—=2 2. 
dz* - ax nu 

Solution 


Using Procedure 2.2, let us try y = pjx + po. Differentiating this gives 


dy _ dy 0 
de? Ggt 
Substituting these into the left-hand side of the differential equation gives 
dy dy 
ee = Oy, 
de?’ “de  “P! 


But there is no value of p; that satisfies 2p; = 2” + 2 for all x. 


The problem this time is that the complementary function is y = C+ De", See Exercise 1.6(c). 
so the po part of the trial solution is a solution of the associated homogeneous 

equation (with C = pp, D=0). Hence, on substituting the trial solution 

y = pix + po into the inhomogeneous equation, the po part disappears, and 

the trial solution effectively reduces to y = p,x. The result in this case is 

that, after substituting the trial solution and its derivatives into the left- 

hand side of the equation, there are not enough terms on the left-hand side 

to compare with the terms in the right-hand-side function. 


Again, the difficulty can be overcome by multiplying the trial solution sug- 
gested by Procedure 2.2 by 2, to give y = p1x? + pox. Differentiating this 
gives 


dy d*y 
—— =2n,0+ — > = 2p}. 
7 Pl PO; ] 2 Pl 
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Substituting these into the left-hand side of the differential equation gives 


d*y dy 
5 + 2—— = 2p + 2(2pix + po) = 4pix + (2p1 + 2po). 
dx dx 
Therefore y = pix? + pox is a solution of the differential equation provided 


that 4p1xz + (2p1 + 2po) = 2a + 2 for all x. This gives p; = 5 po = 5 SO 
Yp = an + 2) 
is a particular integral for the given differential equation. Hl 
To summarize, Procedure 2.2 will fail if all or part of the suggested trial 
solution is a solution of the associated homogeneous equation. In such cases, 


a particular integral can usually be found by multiplying the trial solution 
by x. 


However, it may sometimes be necessary to multiply the trial function by x 
more than once, as explained in Procedure 2.3 and Exercise 2.12. 


Procedure 2.3 Exceptional cases 


Suppose that you try using the method of undetermined coefficients 
(described in Procedure 2.2) for finding a particular integral for an in- 
homogeneous linear constant-coefficient second-order differential equa- 
tion. 


If this fails because all or part of the trial solution is a solution of 
the associated homogeneous equation, then try multiplying the trial 
solution by the independent variable zx. 


If all or part of the resulting trial solution is still a solution of the 
associated homogeneous equation, then try multiplying by x again. 


Exercise 2.12 
Find a particular integral for 


dy dy 
ey ae 
dx? dx a 


2.4 Combining cases 


You have seen how to find a particular integral when the right-hand-side 
function f(a) is polynomial, exponential or sinusoidal. In this subsection 
you will see how to find a particular integral when f(x) is a combination of 
such functions, by using the principle of superposition. 


Example 2.8 
Find a particular integral for 
2 
TY soy = 268 + 1802 +18. (2.7) 
dx? 
Solution 
In Example 2.4 (page 129) you saw that yp, = ses" is a particular integral 
for 
dy 3 
ca) + 9y = 2e a 
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You found the complementary 
function in Exercise 1.6(d). 


Section 3 Initial conditions and boundary conditions 


and in Example 2.2 (page 126) you saw that yp = 2+ 1 is a particular 
integral for 

d?y 

PP) = Oy =9xr+9. 
Therefore, by the principle of superposition (Theorem 1.1), a particular 
integral for Equation (2.7) is 


Yp = ge +2x (e+ 1)= fe" +2c+2. EO 
The approach of Example 2.8 is to find particular integrals for differential 


equations involving each part of f(x) separately, and then to use the prin- 
ciple of superposition to combine the two. 


Exercise 2.13 


Find particular integrals for each of the following differential equations. 


A d 

(a) “4 = 2 + y = 4e* — 362% See Exercise 2.12. 
d? d 

(b) 2 +35 + 2x = 12cos2t+10 


End-of-section Exercise 


*Exercise 2.14 


Find the general solution of each of the following differential equations. You will find some help for 
a0 parts (a), (d), (e) and (f) in 
(a) —, +46 = 2t (b) u’(t) + 4u’(t) + 5u(t) =5 Exercises 1.3 and 1.6, and 
dt Example 1.3. 
@yY ay a) d?y yy 
d?y . dy dy On 
(e) qa + 4Y = sin 2a + 3a (f) dat dg t 2Y = 20" — Be™ 


3 Initial conditions and boundary 
conditions 


In Section 2 you saw how to find the general solution of an inhomogeneous 
linear constant-coefficient second-order differential equation as a combina- 
tion of a complementary function and a particular integral. In practice, 
however, we usually want a particular solution that satisfies certain addi- 
tional conditions. Recall that a particular solution is one that does not 
involve arbitrary constants. In Unit 2 you saw how one additional condi- 
tion (called an initial condition) was needed to find a value for the single 
arbitrary constant in the general solution of a first-order differential equa- 
tion in order to obtain a particular solution. In the case of second-order 
differential equations, in order to obtain a particular solution two additional 
conditions are needed to obtain values for the two arbitrary constants in the 
general solution. 
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There are two types of additional conditions for second-order differential 
equations: initial conditions and boundary conditions. Problems involving 
such conditions are called initial-value problems and boundary-value prob- 
lems, respectively, and are discussed in Subsections 3.1 and 3.2. 


3.1 Initial-value problems 


For a first-order differential equation, an initial condition consists of specify- 
ing the value of the dependent variable (y = yo, say) at a given value of the 
independent variable (x = xo), and is often written in the form y(x%o) = yo. 
One fairly obvious way of specifying two additional conditions for a second- 
order differential equation is to give the values of both the dependent vari- 
able (y = yo) and its derivative (dy/dx = zo) for the same given value of the 
independent variable (x = xo). 


There are many examples of such a pair of initial conditions occurring nat- 
urally as part of a problem. One example is provided by the marble being 
dropped from the Clifton Suspension Bridge, in Exercise 2.11. In that ex- 
ample, x is the vertical distance from the point of dropping. The obvious 
choice of origin for the time t is the time at which the marble is dropped. 
Therefore a naturally occurring pair of initial conditions is that, at time 
t = 0, we know both the position x = 0 and the speed « = 0 (since the mar- 
ble is dropped, i.e. is released with zero initial velocity). Another example 
is provided by the clock pendulum in Exercise 1.7. In this example, when 
the pendulum changes direction, its rate of change of angle @ is momentarily 
zero; also, when it changes direction, it makes its greatest angle 09 with the 
vertical (see Figure 3.1). Therefore, if we measure time ¢ from the moment 
the pendulum changes direction, we have the initial conditions 6 = 09 and 
6 =0 whent = 0. 


Definitions 


(a) Initial conditions associated with a second-order differential equa- 
tion with dependent variable y and independent variable x specify 
that y and dy/dzx take values yo and zo, respectively, when x takes 
the value xo. These conditions can be written as 


d 
Y = Yo and —4 = 2) when x = 2p 
dz 
or as 
y(zo) = yo, y' (#0) = 20. 
The numbers zo, yo and 2g are often referred to as initial values. 


(b) The combination of a second-order differential equation and initial 
conditions is called an initial-value problem. 


Let us now see how initial conditions can be used to determine values for 
the two arbitrary constants and hence find a particular solution. 


Example 3.1 

Find the particular solution of the differential equation 
dy dy 
— —-3— + 2y=0 
dx? dx vee 


that satisfies the initial conditions y = 0 and dy/dx = 1 when x = 0. 
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See Unit 2. 


Figure 3.1 


Section 3 Initial conditions and boundary conditions 


Solution 
From Example 1.3 we know that the general solution is 

y = Ce® + De**, (3.1) 
where C and D are two arbitrary constants. One of the initial conditions 


involves the derivative of the solution, so we need to obtain the derivative 
of the general solution: 


- Ce® + 2De*. (3.2) 
dx 


The initial conditions state that y(0) = 0, y/(0) = 1. Substituting « = 0, 
y = 0 into Equation (3.1) gives 
0=Ce°+De° =C+D, 
while substituting x = 0, dy/dx = 1 into Equation (3.2) gives 
1=Ce°+2De° =C + 2D. 


Solving these equations gives C = —1, D = 1, so the required particular Note that, when checking a 
solution is particular solution, you 
5 should check that it satisfies 
y=-e’+e". Hf the initial or boundary 


conditions as well as the 


5 differential equation. 
*Exercise 3.1 _ 


Find the solutions of the following initial-value problems. 


(a) u(t) +9u(t) =0, u(Z) =0, w'(Z) =1. See Exercise 1.5(b). 
2 
(b) : 3 + 2y = 4e*, y(0) =4, y/(0) =2. See Example 1.3 and 
da“ du Exercise 2.10(a). 
Py dy 
(ce) 53-25 +9 =4e" = 3e?, (0) =4, y’/(0) = -1. See Exercises 1.6(d) 
da“ dx and 2.13(a). 


You saw in Unit 2 that an initial-value problem involving a linear first- 
order differential equation has a unique solution under certain circumstances. 
(Such circumstances hold for nearly every such initial-value problem that you 
are likely to come across in practice.) The same is true of initial-value prob- 
lems involving a linear constant-coefficient second-order differential equa- 
tion, as the following theorem makes clear. 


Theorem 3.1 


The initial-value problem 


a? d 
wea ui i +ey=f(x), — y(@o) = yo,  y'(®0) = 20; 


where a, b, c are real constants with a £ 0, and f(x) is a given contin- 
uous real-valued function on an interval (r,s), with xo € (r,s), has a 
unique solution on that interval. 


Note that one consequence of this theorem is that if the differential equation 
is homogeneous and the initial conditions are of the form y(xo9) = 0 and 
y' (xo) = 0, then the unique solution must be the zero function y = 0, since 
it satisfies the differential equation and the initial conditions. 
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3.2 Boundary-value problems 


The two conditions in an initial-value problem (the value of the dependent 
variable y and its derivative dy/dx) both relate to the same value of z. 
However, the two conditions needed to determine values for the arbitrary 
constants need not relate to the same value of x. We could have one condi- 
tion for « = xg and another for « = x1, say. For example, consider again the 
‘sagging’ beam from Exercise 2.9. Two known conditions on this beam are 
its zero displacements at the ends of the beam, where it rests on the rigid 
supports: that is, its boundary conditions are y(0) = 0 and y(l) = 0 (where | 
is the length of the beam). This pair of boundary conditions gives the value 
of y at two different points, but in general each boundary condition could 
specify the value of either y or dy/dz (or even a relationship between them). 


Definitions 


(a) Boundary conditions associated with a second-order differential 
equation with dependent variable y and independent variable x 
specify that y or dy/dx (or some combination of the two) takes 
values yo and y; at two different values x9 and x1, respectively, of x. 
The numbers 29, 21, yo and y; are often referred to as boundary 
values. 


(b) The combination of a second-order differential equation and bound- 
ary conditions is called a boundary-value problem. 


The conditions are referred to as ‘boundary’ conditions because, as in the 
beam example, they often relate to conditions at the endpoints xo and x, of 
an interval [29,27] on which we are interested in exploring the differential 
equation. 


Let us now see how boundary conditions can be used to determine values 
for the two arbitrary constants and hence find a particular solution. 


Example 3.2 

Find the particular solution of the differential equation 
dy 
— + 9y=0 
dx? 


that satisfies the boundary conditions y = 0 when x = 0 and dy/dx = 1 
when x = 3. 
Solution 
From Exercise 1.5(b), we know that the general solution is 

y = Ccos3z + Dsin3za, (3.3) 
where C and D are two arbitrary constants. 


One of the boundary conditions involves the derivative of the solution, so 
we need to obtain the derivative of the general solution: 
d 
7 = —3C sin 3x + 3D cos 32. (3.4) 
xv 
The boundary conditions state that y(0) = 0, y' (3) = 1. Substituting x = 0, 
y = 0 into Equation (3.3) gives 


0=Ccos0+ Dsin0d = C, 
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ie. C =0. Substituting x = §, y/ = 1 and C = 0 into Equation (3.4) gives 
1= 3Dcos7 = —3D. 
Therefore C = 0, D= —$, so the required particular solution is 


y=—qsin(3c). Ml 


Exercise 3.2 
Find the solution of the following boundary-value problem: 
d?y d 


7 3S + 2y = 4e”, HF OS2. gil) —0 See Exercise 3.1(b). 


Exercise 3.3 
Use the differential equation of Exercise 2.9, with R= S = Q = 1, namely 
y —yt 5 (I ej = 0, (3.5) 


to determine the vertical displacement at the centre of a beam of length 
2 metres resting on rigid supports at its ends. 


Unlike the case of initial-value problems, boundary-value problems may not 
have solutions even when the differential equation is linear and constant- 
coefficient with a continuous real-valued right-hand-side function, as the 
following example illustrates. 


Example 3.3 

Try to find a solution of the boundary-value problem 
dy = 
age t y= 9; y(0)=0, y(%)=1. 

Solution 


From Exercise 1.6(a), the general solution is 
y = Ccos2a + Dsin 2z, 
where C and D are two arbitrary constants. 


The boundary conditions state that y(0) = 0, y(Z) = 1. Substituting each 
of these into the general solution in turn gives 

0 = Ccos0+ Dsin0=C, 

1=Ccos7+ Dsina = —C. 


There is no solution for which C' = 0 and C = —1, so there is no solution of 
the differential equation that satisfies the given boundary conditions. Ml 


Fortunately it is rare for a boundary-value problem that models a real-life 
situation to have no solution (and in such cases it is usually possible to 
reformulate the model to overcome the difficulty). 


Not only is it possible for boundary-value problems to have no solution, but 
it is also possible for them to have solutions that are not unique, as the 
following example illustrates. 
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Example 3.4 
Find the solution of the boundary-value problem 
d’y dy 
Tat + ge + Y= 5 YO) = 1, (a) 
Solution 


From Exercise 2.14(b), the general solution is 
y=e 7 (Ccosx + Dsinz) +1, 
where C and D are two arbitrary constants. 


The boundary conditions state that y(0) = 1, y(7) = 1. Substituting each 
of these into the general solution in turn gives 


1 = e*(Ccos0 + Dsin0)+1=C +1, 
=e "(Coser + Dein) + lS Oe £1. 


Both of these equations reduce to C = 0, but D can take any value, so any 
solution of the form 


y = De“ sing +1 


satisfies the differential equation and the boundary conditions. li 


In Example 3.4, there is not a unique solution of the differential equation 
that satisfies the given boundary conditions, but instead there is an infinite 
family of possible solutions. 


Finally, a word of reassurance: most of the boundary-value problems that 
you will come across in this course will have a unique solution. 


End-of-section Exercises 


*Exercise 3.4 


For each of the following problems, identify the conditions as either initial 
conditions or boundary conditions, and find the solution of each problem. 


(a) u(x) +4u(z)=0, u(0)=1, w’(0) =0. 
(b) w(x) + 4u(z)=0, u(0)=0, u(Z) =0. 
(c) w(x) +4u(z)=0, u(Z)=0, w'(Z) =0. 
(d) w(x) +4u(z)=0, u(—m7)=1, u(Z) =2. 
(e) u(x) + 4u(z)=0, w(0)=0, uw (F)=1 


Exercise 3.5 
Find the solution (if any) of each of the following problems. 
(a) u(t) + 4u/(t)+5u(t)=0, u(0)=0, u’(0) =2. 


2 
(b) OP oh oy ay Siena 0a, 
dx? da. dx: 
(c) #+9x =3(1—-mt), 2(0)=4%, #(%) =0. 
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You found the general 
solution of the differential 
equation in Exercise 1.6(a). 


See Exercise 2.14(b). 


See Exercise 1.9(a). 


See Exercise 1.5(b). 


Section 4 


4 The nature of solutions 


This section is intended principally to assist in the understanding of the na- 
ture of oscillatory solutions to problems involving linear constant-coefficient 
second-order differential equations. For the whole of this section we shall 
assume that the differential equation has the form 


d’y dy 
a—, + b—-+cy = f(x), 
da + ag + Y= FC) 

in which a, b and care positive. (This is almost always the case for equations 
arising in mechanics.) 


4.1 Transients 
Consider the equation 
dy ,dy 
a—> + b—+cy=0 with a>0,b>0,c>0. 
dx? dx 
The nature of the general solution depends on the nature of the roots of the 


auxiliary equation a\? + b’\+c=0. More specifically, you saw in Proce- 
dure 1.1 that 


A1, Ag real and distinct = solution y(z) = Ce™* + De", 
A1, Ag real and equal = solution y(z) = (C+ Dz)e™*, 
Ai, A2 complex (=a+i8) = solution y(x) = e%(C cos Bx + Dsin Br), 


where in each case C' and D are arbitrary real constants. 
Since the auxiliary equation has A; and 2 as roots, it may be written as 
(A= A1)(A — A2) = 0, 
or 
rN? — (Ar + Ag)A + AtAg = 0. 
When we divide through the original auxiliary equation by a, we obtain 
b 
a 


Ve a, 
a 


Comparing the coefficients of A in these two versions of the same quadratic 
equation, we find that 


b 
Ai+d2=—— and Aida = =. 


Now, using the fact that a, b and care positive, we can make some interesting 
deductions. First, c/a must be positive so, if they are real, 4; and 2 
have the same sign. Also, since —b/a is negative, the sum \, + A2 must 
be negative. There is only one conclusion to draw from this: if Ay and Ag 
are real, then both are negative. If on the other hand A, and A2g form the 
complex conjugate pair a +i, then their sum is 


Ay + Ag = 2a. 
Now A; + Ag = —b/a being negative implies that a must be negative. 


The upshot is that, in the above list of solutions, all the exponential terms 
have a negative index. Thus for large values of x, the magnitude of all the 
above solutions will become small. This phenomenon represents damping, 
and you will meet it again in Unit 17. 


The graphs of typical complementary functions in the three cases are shown 
in Figure 4.1. 


The nature of solutions 
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1.57 At, Ae complex 


ay 


1.5- At, Ag real and equal 


15+ At, Ag real and distinct 


sv 


-1.5-5 


2 
Figure 4.1 


In such cases, when the complementary function tends to zero, it is known as 
the transient or the transient solution, in that it essentially disappears 
for large enough x. 


If we now turn to the inhomogeneous equation 
Py _,dy 
a—> + b—+cy= f(a 
age t Oge + Y= F2), 
the above discussion shows that when a, b and c are positive, the comple- 
mentary function is transient and will not affect the long-term behaviour of 
the solution. 
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For this reason, a particular integral not involving part of the transient 
solution is known as the steady-state solution. In most cases, for large x, 
because the transient solution then has little effect, the solution settles down 
to a ‘steady state’ given by the contribution from that particular integral. 
Two typical solutions to initial-value problems of this type are sketched at 
the top of Figure 4.2. Here you can see the two examples of a particular 
solution, followed by the contributions made to each by the transient and 
the steady-state solution. 


Section 4 The nature of solutions 


However, it is still possible 
that a particular integral may 
decay at an even faster rate 
than the complementary 
function! 


YA yA 
2 particular solution 24 particular solution 
17 1- 
0 < 0 T Ey 
10 20 30 40 50 10 20 30 40 50 
-|- -l- 
2-4 27 
YA yA 
2- transient solution 2- transient solution 
17 1- 
0 > 0 — 
10 20 30 40 50 10 20 30 40 50 
-1- -l- 
2-4 27 
yA YA 
24 steady-state solution 24 steady-state solution 
1+ 17 
QOS OS OOO ONO * Owe Os 0 2 O82 OS Ome * 
10 20 30 40 50 10 20 30 40 50 
_j- ie 
24 ~)- 
Figure 4.2 
Example 4.1 


Consider the differential equation of Example 2.5 (page 130), 
d?y dy : 

age + a + 2y = 10sin 2z, 

together with the initial conditions y(0) = 

particular solution and the steady-state behaviour of the solution. 

Solution 

The auxiliary equation is 


dM? +244+2=0, 


with roots —1 +7. In Example 2.5 we found a particular integral 


y(x) = —2cos 2x — sin 2a. 


—1 and y’(0) = —2. Find the 


143 


Unit 3 Second-order differential equations 


Therefore the general solution of the differential equation is 
y(x) =e *(Ccosx+ Dsinx) — 2 cos 22x — sin 22. 


Substituting the initial conditions shows that C = D = 1, so the particular 
solution to the initial-value problem is 


y(z) =e *(cosxz +sin x) — 2cos2z — sin 2z. 


This is made up of the transient e~*(cos z + sin x), which quickly dies away, 
and the steady-state solution —2 cos 2x — sin 2”, which determines the long- 
term behaviour. 


We say that the terms —2cos2z and —sin2x dominate the solution for 
large positive values of z. Hi 


In the remainder of this section you are invited to investigate these ideas 
using the computer algebra package. 


4.2 Solving initial-value problems on the computer 


The computer algebra package for the course allows you to solve initial- 
value problems involving linear constant-coefficient second-order differential 
equations in general, and not just when all the coefficients are positive. The 
following activity asks you to solve such problems, and to examine the nature 
and behaviour of the solutions by means of graphs. 


Use your computer to complete the following activity. 


*Activity 4.1 
Solve each of the following initial-value problems. 
(a) = 3 +2y=4e", y(0)=4, y/(0) =2. 
(b) 7a + 3 =sinz, y(0)=0, y/(0)=-1. 
(c) soe + ee +y=2cos2z, y(1)=0, y(1)=1. 
(d) say ake 5 +5y=4sinz, y(0)=1, y’(0)=-2. 
(e) a ee + 4y =3cos3z, y(%)=0, (3%) =5. 


In each case, consider the long-term behaviour of the solution, and try to 
identify which terms will dominate the solution for large positive values of x. 
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Outcomes 


Outcomes 


After studying this unit you should be able to: 


e understand and use the terminology relating to linear constant-coefficient 
second-order differential equations; 

e understand the key role of the principle of superposition in the solution 
of linear constant-coefficient second-order differential equations; 

e obtain the general solution of a homogeneous linear constant-coefficient 
second-order differential equation using the solutions of its auxiliary 
equation; 

e use the method of undetermined coefficients to find a particular integral 
for an inhomogeneous linear constant-coefficient second-order differential 
equation with certain simple forms of right-hand-side function; 

e obtain the general solution of an inhomogeneous linear constant- 
coefficient second-order differential equation by combining its comple- 
mentary function and a particular integral; 

e use the general solution together with a pair of initial or boundary condi- 
tions to obtain, when possible, a particular solution of a linear constant- 
coefficient second-order differential equation; 

e understand the nature of the solutions of linear constant-coefficient 
second-order differential equations with positive coefficients, particularly 
those involving transient and steady-state parts; 

e use the computer algebra package for the course to solve a second-order 
differential equation. 
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Solutions to the exercises 


Section 1 


1.1 (a) Equations (i), (ii), (iii), (vii), (viii) and (ix) 
are linear and constant-coefficient. (Equations (v) 
and (vi) are non-linear; (iv) is linear but not constant- 


coefficient.) 


(b) Of the linear constant-coefficient equations, only 
(iii) and (ix) are homogeneous. 

(c) In Equations (i)—(vi) the (dependent, independent) 
variable pairs are all (y,x). In Equations (vii), (viii) 
and (ix) they are (t,0), (,t) and (a,t), respectively. 


1.2 (a) 7 -5\+6=0 
(b) \7-9=0 
(c) 7 +2A=0 


1.3 (a) The auxiliary equation is \7 +5\+6=0. 
Solving this by factorization as (A + 2)(A+ 3) = 0 gives 
the roots Aj = —2 and 2 = —3. The general solution 
is therefore 

y=Ce ” + De-*. 
(b) The auxiliary equation is 2\? + 3A =0. This can 


be factorized as (2A + 3) = 0, so its roots are A, = 0 
and Ay = -3. The general solution is therefore 


y = Ce? + De~**/? = C+ De 3*/?, 


(c) The auxiliary equation is \” — 4 = 0, i.e. \* = 4, so 
its roots are Ay = —2 and Az = 2. The general solution 
is therefore 


z= Ce * + De. 


1.4 (a) The auxiliary equation is \7 +2\+1=0, 
which can be factorized as (A+ 1)? = 0, giving equal 
roots Ay = Ag = —1. The general solution is therefore 
y=(C+Da)e*. 
(b) The auxiliary equation factorizes as (\ — 2)? = 0, 
which has equal roots A, = Az = 2. The general solu- 
tion is therefore 
s=(C+Dte™. 


1.5 (a) The auxiliary equation is 7 +4\+8 =0, 
which has solutions 
—42+/16-—32 _ 
5 = 
The general solution is therefore 


y =e **(Ccos2z + Dsin2z). 


A= 242%. 


(b) The auxiliary equation is \? + 9 = 0, which has 
solutions 
= +32. 
The general solution is therefore 
6 = e°(Ccos 3t + Dsin 3t) = Ccos3t + Dsin3t. 
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(Note how simple the solution is when there is no first- 
derivative term in the differential equation. In general, 
the solution of an equation of the form y” + w?y = 0, 
where w is a constant and z is the independent variable, 
is y= Ccoswa + Dsinwz.) 


1.6 (a) The auxiliary equation is \* + 4 = 0, which 
has solutions \ = +2i. The general solution is there- 
fore 

y = Cocos 2a + Dsin 2a. 
(You could also have written down this general solution 
directly using the remark in Solution 1.5(b).) 


(b) The auxiliary equation is \7 — 6\+ 8 = 0, which 
has solutions A; = 4 and Ag = 2. The general solution 
is therefore 


u = Ce** + De**. 
(c) The auxiliary equation is \? + 2\ = 0, which has 


solutions Ay = 0 and Az = —2. The general solution is 
therefore 
y=C+De-*. 


(d) The auxiliary equation is \” — 2+ 1 = 0, which 
has solutions A; = A9 = 1. The general solution is 
therefore 

y =(C+4+ Da)e”. 
(e) The auxiliary equation is A? — w* = 0, which has 
solutions \ = tw. The general solution is therefore 

y = Ce** + De”. 
(f) The auxiliary equation is \? + 44 + 29 = 0, which 
has solutions A = —2 + 5i. The general solution is there- 
fore 

e **(Ccos5a + Dsin5z). 


1.7 The auxiliary equation is \? + g/l = 0, which has 
solutions \ = +i\/g/l. The general solution is therefore 


0 = Cex (/%1) + Dsin( 20), 


(This is another example of an equation involving no 
first-derivative term. So you could have written down 
the general solution directly using the remark in Solu- 
tion 1.5(b).) 


1.8 (a) The required auxiliary equation is 
3-1-2)? =0, 

or, equivalently, 
247 —3’4+1=0. 


(b) The two solutions of the auxiliary equation are 
Ai = $ and Ay =1. 


(c) By Procedure 1.1, the general solution is 
1 
y = Ce2” + De®, 
where C' and D are arbitrary constants. 


1.9 (a) \7+2\+2=0 has solutions —1+ i, so the 
general solution is y = e~*(C'cosa + Dsin a). 


(b) \? — 16 = 0 has solutions +4, so the general solu- 
tion is y= Ce*” + De~*. 

(c) A? —4\+4=0 has solutions A; = Az = 2, so the 
general solution is y = (C + Dz)e?*. 


(d) \?+3A =0 has solutions \; = 0 and Ag = —3, so 
the general solution is @ = C + De~*". 


1.10 The auxiliary equation \” + 4k\ + 4 = 0 can be 
solved using the formula method to give \ = —2k + 
2Vk? —1. So there are complex conjugate solutions, 
leading to a general solution involving sines and cosines, 
when k? < 1, ie. when |k| < 1. 


Section 2 


2.1 We could check this directly, by substituting 
Y= Yp, — Yp. imto the associated homogeneous equa- 
tion. However, it is easier to appeal to the principle 
of superposition. Since yp, and yp, both satisfy 
Py _, dy 
ara + o +cy = f(a), 
Theorem 1.1 shows that the combination y = yp, — yp 
indeed satisfies 
dy |, dy 
ava tba teu = f(z) — f(z) = 0, 
as required. 


2 


2.2 (a) The associated homogeneous equation is 
d?y/dx? +4y =0. The complementary function (see 
Exercise 1.6(a)) is ye = Ccos2x + Dsin 2x. 
Trying a solution of the form y, = p, where p is a con- 
stant, in the original equation d?y/dx? + 4y = 8 gives 
0+ 4p = 8, so that p= 2. Thus a particular integral is 
Up = 2. 
By Procedure 2.1, the general solution is 

y = Ccos2x + Dsin2x + 2. 
(b) The associated homogeneous equation is 
d?y/dx? — 3dy/dx + 2y = 0. The complementary func- 
tion (see Example 1.3) is y. = Ce* + De?*. 
Trying a solution of the form yp) = p in the original equa- 
tion d?y/dx? — 3dy/dx + 2y = 6 gives 0 — 0+ 2p = 6, so 
that p = 3. Thus a particular integral is y) = 3. 
By Procedure 2.1, the general solution is 


y = Ce* + De +3. 


2.3 (a) Substituting y = pix + po and its derivatives 
into the differential equation gives 


O— 2p, + 2(p1e + pp) = 2piz + (2p9 — 2p) = 2x + 3. 


Equating coefficients gives p; = 1, pop = 3. Therefore a 
particular integral is 


Yp = 2+ 3. 


Solutions to the exercises 


(b) Substituting y = pix + po and its derivatives into 
the differential equation gives 


O+ 2p + (pit + po) = pix + (2p1 + po) = 2a. 
Hence p; = 2, po = —4, and a particular integral is 


Yp = 2a —A4. 


2.4 We try y = pot? + pit + po, which has derivatives 
y = 2pot+ pi, y” = 2p2. Substituting these into the 
differential equation gives 


pit + (2p2 — po) 


2p2 — (pet? + pit + po) = —pat? 


=i, 
Hence po = —1, p, = 0, pop = —2, and a particular inte- 
gral is 
Yp = ai 9, 


2.5 We try a solution of the form y = pe~*, which has 
derivatives dy/dx = —pe~*, d?y/dx? = pe~*. Substi- 
tuting these into the differential equation gives 

2pe—” + 2pe~* + pe 


Hence p= 2, and a particular integral is 


* = 5pe * = 2e”. 


2.6 We try y = pcos 3t + qsin 3t, which has derivatives 


d 

7 = —3psin 3t + 3qcos 3t, 
d?y ; 
ae —9p cos 3t — 9qsin 3t. 


Substituting into the differential equation gives 
(—9p cos 3t — Iqsin 3t) — (—3psin 3t + 3q cos 3t) 
= —(9p + 3q) cos 3t + (3p — 9q) sin 3t 


= cos 3t + sin 3t. 
Hence we have a pair of simultaneous equations 
—9p — 3q = 1, 
3p —9q=1. 
Adding three times the second equation to the first 
shows that g = -<, whence p= -<. A particular 
integral is thus 
Yp = —7s Cos 3t — 4 sin 3t. 


27 (a) Try y= pe. 
(b) Try y = pcos3a + qsin3z. 


2.8 (a) From Exercise 1.9(a), the complementary 
function is y. = e~*(Ccosx+ Dsinz). To find a par- 
ticular integral, try y = po. Substituting into the differ- 
ential equation gives 

0+0+ 2p9 = 2p0 =A, 
Hence po = 2, and a particular integral is yp = 2. 
Therefore the general solution is 


y=e *(Ccosx+ Dsinz) +2. 


147 


Unit 3 Second-order differential equations 


(b) From Exercise 1.9(d), 6. = C + De~*'. To find a 
particular integral, try 6 = pcos3t+ qsin3t. Differen- 
tiating gives 

dé 


a —3psin 3t + 3qcos 3t, 
a? ; 
qe = —9pcos 3t — 9qsin 3t. 


Substituting into the differential equation gives 
(—9p cos 3t — 9qsin 3t) + 3(—3psin 3t + 3q cos 3t) 
= (9q — 9p) cos 3t — (9g + Yp) sin 3t 
= 9cos 3t. 
This gives a pair of simultaneous equations to solve: 
—9p+9q = 9, 


—9p —9q = 0. 
Hence p = -t, q= $, and a particular integral is 
a, = —4 cos 3t + 3 sin 3t. Therefore the general solu- 
tion is 


0=C + De — 5 cos 3t + $ sin 3t. 


2.9 Putting the equation into standard form and using 
R=S=Q=1 gives 

yf! —y=—5l 
The associated homogeneous equation is y/’ — y = 0, 
which has auxiliary equation \” — 1 = 0. This has roots 
A = +1, so the complementary function is 

Ye = Ce* + De®. 
To obtain a particular integral, we try a function of 
the form y = pox? + pix + po. Its derivatives are y! = 
Qp2x+ pi, y” = 2p2. Substituting into the differential 
equation gives 


Qpe — (pex” + pix + po) = —p22” 


_ 1 1,2 
gaa—ple+s52°. 


pit + (2p2 — po) 


=e 
Hence p2 = —$, pi = 51, po = —1, and a particular in- 
tegral is 
Yp = $a" + glx -1. 


Therefore the general solution is 


y = Ce* + De® $x + Sla — 1. 


2.10 (a) From Example 1.3, the associated homoge- 
neous equation has general solution y = Ce” + De?*, 
and the trial solution y = pe” suggested by Proce- 
dure 2.2 is a solution of this equation (with C = p, 
D=0). So we try y = pxe® instead. Differentiating 
twice gives 

a pe” + pre* = p(1+az)e", 

dx 

2 

“4 = pe” + p(1+a)e” = p(24+2)e”. 
Substituting into the left-hand side of the differential 
equation gives 


x 


p(2+ ax)e” — 3p(1 + x)e” + 2pre” = —pe* = 4e”. 
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Hence p = —4, and a particular integral is 
Yp = —4ae”. 


(b) From Exercise 1.3(b), the associated homogeneous 
equation has general solution y = C + De~#*/?, and the 
trial solution y = po suggested by Procedure 2.2 is a so- 
lution of this equation (with C = po, D = 0). So we try 
y = pox. Differentiating twice gives 

dy dy 

dx 7° dy 
Substituting into the left-hand side of the differential 
equation gives 3p9 = 1, so pp = §, and a particular in- 
tegral is 


2.11 The associated homogeneous equation is 

m\* +rrA=0, 
with solutions A = 0 and \ = —r/m. The complemen- 
tary function is therefore 

Ge =C+De/™, 
The inhomogeneous term is mg, so Procedure 2.2 sug- 
gests a trial solution x = po. However, this is a solution 
of the associated homogeneous equation (with C = po, 
D=0). Hence we try x = pot instead. Differentiating 
and substituting gives 


rPo = mg, 
so 
mg 
PO. = 
7 
Hence a particular integral is 
mgt 
Lp = — 
Pp e 


and the general solution is 
e=C+ Dewrt/™m a “. 
if 


2.12 From Exercise 1.6(d), the associated homoge- 
neous equation has general solution y = (C + Dz)e’, 
so not only is the trial solution y = pe” suggested by 
Procedure 2.2 a solution of the associated homogeneous 
differential equation (with C = p, D=0), but so is 
y = pre® (with C=0, D=p). So we try y = pre”. 
Differentiating twice gives 

“ = 2pre® + px?e* = p(2a + x )Je”, 

d?y 
dx? 


= p(2 + 2x)e* + p(2x + x? )e* 

= p(2+ 4a + 27 )e*. 

Substituting into the differential equation gives 
p(2 + 4a + 27)e* 


Qp(2x + ax”)e* + pare” = 2pe* 


=e". 


and a particular integral is 


Hence p = s, 
Ae 


2 
Yp = gv 


2.13 (a) From Exercise 2.12, yp = $27e” is a partic- 
ular integral for 
Py dy 


So, using the principle of superposition, we can find a 
particular integral for the given differential equation if 
we can find one for 


dy dy 
2— + y = —3e”. 
dx dx = 7 
We try y = pe?”, which has derivatives 
dy d?y 
hae 9) 22 cei ee 4 2a 
dx a = 


Substituting into the differential equation gives 

Ape*® — Ape” + pe** = pe** = —3e”*. 
Hence p = —3, and yp = —3e”” is a particular integral 
for the differential equation with right-hand side —3e?*. 
Thus, using the principle of superposition, a particular 
integral for the given differential equation is 

Yp = 4(5a7e”) — 3e°* = 2a*e* — 3c”. 
(b) This time we do not have a particular integral for 
any part of the right-hand-side function, so we need to 
start from scratch. 
First consider the 12 cos 2t term on the right-hand side, 
and try x = pcos 2t+ qsin 2¢ as a trial solution. This 
has derivatives 


d 
a = —2psin 2t + 2qcos 2t, 
d2 
aa = —4p cos 2t — 4qsin 2t. 


Substituting into the differential equation gives 
2(—4p cos 2t — 4qsin 2t) + 3(—2psin 2t + 2¢ cos 2t) 


+ 2(pcos 2t + qsin 2t) 
= 6(q — p) cos 2t — 6(p + q) sin 2t 
= 12 cos 2t. 


So p+q=0, q—p= 2, hence p= —1, q=1, anda 
particular integral is 

Lp = — cos 2t + sin 2¢. 
Now consider the 10 term, and try x = po. Substituting 
into the differential equation gives 2p9 = 10, so po = 5, 
and a particular integral is 

Lp = 9. 
Therefore, using the principle of superposition, a 
particular integral for the differential equation with 
f(t) = 12 cos 2t + 10 is 


Lp = — Cos 2t + sin 2t + 5. 


2.14 (a) From Exercise 1.6(a), the complementary 
function is 

6. = Ccos 2t + Dsin 2t. 
To find a particular integral, try 0 = pjt+ po. Substi- 
tuting this and its derivatives into the differential equa- 
tion gives 

A(pit + po) = 2t. 


Solutions to the exercises 


Hence p; = $, Po = 0, and a particular integral is 


Therefore the general solution is 

6 = Ccos 2t + Dsin 2t + st. 
(b) The auxiliary equation is \* + 4\+5 =0, which 
has solutions \ = —2 +7. So the complementary func- 
tion is 

Ue = aa (81 cost + Dsint). 
To find a particular integral, try u = po. Substituting 


gives 5p) = 5. Hence po = 1, and a particular integral 
is 


Up = 1. 
Therefore the general solution is 
u=e “(Ccost+ Dsint) +1. 


(c) The auxiliary equation is 3A? — 2\ — 1 = 0, which 
has solutions Ay = 1 and Ag = —}. So the complemen- 
tary function is 

Y, = Ce® + De-*/8. 
Consider first the e?” term on the right-hand side 
of the equation. To find a particular integral, try 
Y = pe**. The derivatives are dY/dx = 2pe?* and 
d?Y /dx? = 4pe?*. Substituting gives 

3(4pe"") — 2(2pe**) — pe?” = Tpe** = e**. 
Hence p= rT and a particular integral is 

a ze, 
Now consider the 3 term on the right-hand side of the 
equation, and try Y = po. Substituting gives —pp = 3, 
so po = —3, and a particular integral is 

Yp = —3. 
Therefore, using the principle of superposition, a 
particular integral for the differential equation with 
f(x) = e?? +3 is 

Y> = ze" —3. 
Therefore the general solution is 

Y=Ce"+De"" + ze>* — 3. 


(d) From Exercise 1.3(c), the complementary function 
is 

Ye = Ce *™* + De*™. 
To find a particular integral, since e~*” is a so- 
lution of the associated homogeneous equation, try 
y = pre”. The derivatives are dy/dx = p(1 — 2x)e~?* 
and d?y/dx? = 4p(a — 1)e~?”. Substituting gives 

Ap(a — 1)e~*” — 4pre~?* = —4pe** = e~*, 
Hence p= 


20 


-i, and a particular integral is 
Yp = —pue >", 
Therefore the general solution is 


y = Ce—** + De** — que. 
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(e) From Exercise 1.6(a), the complementary function 
is 
Ye = C cos 2a + Dsin 2x. 


To find a particular integral, we note that, from 
part (a), a particular integral for d?y/dax? + 4y = 2z is 
Yp = 4a. So we need to consider only the sin 2x term, 
and then use the principle of superposition. For this 
term, noting the form of the complementary function, 
try y = «(pcos 2x + qsin 2a). The derivatives are 


d 

7 = (p + 2qx) cos 2x + (q — 2px) sin 22, 

d?y : 

aa (4q — 4px) cos 2a — (4p + 4qzx) sin 22. 
x 


Substituting gives 
(4q — 4px) cos 2a — (4p + 4qx) sin 2x 


+ 4x(pcos 2x + qsin 2x) 
= 4qcos 2x — 4psin 2x 
=sin2z. 
Hence p = -, q= 0, and a particular integral is 
Yp = —Fxc0s 2x. 


So, using the principle of superposition, a particular in- 
tegral for the given differential equation is 

Yp = —facos 2x + 3(Sx) = —Facos2x + 32. 
Therefore the general solution is 


y = Ccos 2a + Dsin 2a — +a cos 2a + x. 


(f) From Example 1.3, the complementary function is 
Yo = Ce” + De*™. 

Consider first the 2e* term on the right-hand side of the 

equation. To find a particular integral, since e* appears 

in the complementary function, try y = pxe”, which has 

derivatives 


2 
4 = p(l4+a)e”, 4 = p(24+ x)e*. 
Substituting into the differential equation gives 
p(2 + x)e” — 3p(1 + x)e” + 2pre” = —pe* = 2e”. 
Hence p = —2, and a particular integral is 
Yp = —2re". 


Now consider the —5e?* term on the right-hand side of 
the equation. To find a particular integral, since e?” 
appears in the complementary function, try y = pre", 
which has derivatives 


d 
ae p(1 + 2x)e?*, 
dx 


Substituting into the differential equation gives 


*y 
oe p(4 + 4a)e?*. 


p(4+ 4x)e?” — 3p(1 + 2ar)e?* + 2pxe?* = pe?* 
= —5e*, 
Hence p = —5, and a particular integral is 
Yp = —Sare?*. 


Therefore, using the principle of superposition, a 
particular integral for the differential equation with 
f (x) = 2e* — 5e?* is 

Yp = —2re" — Sze”. 
Therefore the general solution is 


y = Ce® + De?* — 2xe* — 5axe?”. 
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Section 3 


3.1 (a) From Exercise 1.5(b), the general solution is 
u = Ccos3t + Dsin3t. 
Its derivative is 
u’ = —3C sin 3t + 3D cos3t. 
Substituting the initial condition t = 5, u = 0 into the 
general solution gives D = 0. Substituting the initial 
condition t = 3, u’ = 1 into the derivative gives C = 3. 
Hence the required particular solution is 


u = ¥ cos 3t. 


(b) From Example 1.3 and Exercise 2.10(a), the gen- 
eral solution is 

y = Ce® + De* — Aze®. 
Its derivative is 

y’ = Ce* + 2De”* — 4(1+ a)e®. 
Substituting the initial condition z = 0, y = 4 into the 
general solution gives C+ D = 4. Substituting the ini- 
tial condition « = 0, y’ = 2 into the derivative gives 
C+2D—4=2. Hence C = 2, D = 2, and the required 
particular solution is 

y = Qe" + 2c” — Age®. 
(c) From Exercises 1.6(d) and 2.13(a), the general so- 
lution is 

y= (C + Daje* + 207" — 3c”. 
Its derivative is 

y =(C+ D+ Darje* + (4 + 2u*)e* — Ge". 
Substituting the initial condition x = 0, y = 4 into the 
general solution gives C — 3 = 4. Substituting the ini- 
tial condition 2 = 0, y/ = —1 into the derivative gives 
C+ D-6=-1. Hence C=7, D = —2, and the re- 
quired particular solution is 

y = (7 — 2ax)e” + 2a7e* — 3e?* 

= (7 — 22 + 2a” )e* — 3e”*. 


3.2 From Exercise 3.1(b), the general solution is 

y = Ce® + De* — Aze®, 
and its derivative is 

y’ = Ce” +2De** — 4(1+ a)e*. 
Substituting the boundary condition x = 0, y’ = 2 into 
the derivative gives C+2D=6. Substituting « = 1, 
y = O into the general solution gives Ce + De? — 4e = 0, 
which can be rearranged to give C+eD=4. Hence 
C = (8 — 6e)/(2 —e), D = 2/(2—e), and the required 
particular solution is 

8 — 6e 2 og 
= e 


= % Are”. 
Yy De a re 


3.3 From Exercise 2.9, the general solution of Equa- 
tion (3.5) is y = Ce” + De~® — $a? + Slax —1, which 
for | = 2 becomes 

y = Ce™ + De® — $2? + 2-1. 
The boundary conditions, resulting from the beam rest- 
ing on supports at its two ends, are y(0) = 0, y(2) = 0. 


Substitution of these into the differential equation gives 
C+ D-—1=0 and Ce? + De? -1=0. Multiply- 
ing the second equation by e? gives C+ D=1 and 
Ce* + D = e? as the equations to solve. Subtracting 
the equations gives C(e4 — 1) = e? — 1. This gives 


_@-1 e? —1 __4 
~ ef—-1 (e2 + 1)(e2-1) e241’ 
1 e?+1-1 e 
D=1-C=1 = = : 
e2+1 e2+1 e? +1 


Hence the required particular solution is 
— ere) — te? +a = 1, 
y e2 at i ) 2 


At the centre of the beam, x = 1, so y~ 0.148. The 
displacement or ‘sag’ at the centre of the beam is ap- 
proximately 0.148m or about 14.8 cm. 


3.4 Problems (a) and (c) are initial-value problems; 
problems (b), (d) and (e) are boundary-value problems. 


The differential equation is the same in each case, and 
from Exercise 1.6(a) its general solution is 

u = Ccos2a + Dsin 22. 
The derivative is 

u’ = —2C sin 2x + 2D cos 22. 
(a) The condition u(0) = 1 gives C=1. The condi- 
tion u/(0) = 0 gives D=0. The required solution is 
therefore 

U = Cos 2zu. 
(b) The condition u(0) = 0 gives C=0. The condi- 


tion u(4) = 0 gives C =0 also. D therefore remains 


arbitrary, so there is an infinite number of solutions, of 
the form 

u= Dsin2z. 
(c) The condition u(%) =0 gives C = 0. The condi- 
tion u'(Z) =0 gives D=0. The required solution is 
therefore the zero function 

u=0. 
(Alternatively, since the differential equation is homoge- 
neous and the initial conditions are both equal to zero, 
by the remarks after Theorem 3.1 the solution is the 
zero function u = 0.) 
(d) The condition u(—7) = 1 gives C= 1. The con- 
dition u(7) =2 gives D = 2. The required solution is 
therefore 

u = cos2a + 2sin2z. 


(e) The condition u’(0) = 0 gives D=0. The condi- 


tion u’(4) = 1 gives C = —4. The required solution is 
therefore 
u=—4}cos2a. 


Solutions to the exercises 


3.5 Parts (a) and (b) are initial-value problems, and 
therefore by Theorem 3.1 each has a unique solution. 
However, part (c) is a boundary-value problem, which 
may have no solution, a unique solution, or an infinite 
number of solutions. 


(a) From Exercise 2.14(b), the general solution is 
u=e (Ccost+ Dsint). 

Its derivative is 
u’ =e **((-2C + D) cost — (C + 2D)sin?). 

The condition u(0) =0 gives C=0. The condition 

u'(0) = 2 gives D = 2. The solution is therefore 


u = 2e~7! sint. 


(b) The differential equation is homogeneous and the 
initial conditions are both equal to zero. Hence the so- 
lution is the zero function y = 0. 


(c) From Exercise 1.5(b), the complementary function 
is 
te. = Ccos 3t + Dsin 3t. 


To find a particular integral, try x = pit + po. Substi- 
tuting into the differential equation gives 
W(pit + po) = 3(1 — zt). 
Hence p) = —3, Po = 7 and a particular integral is 
Lp = —Rt+ z 
Therefore the general solution is 
x = Ccos3t + Dsin3t — St + §, 
and its derivative is 
& = —3C'sin 3t + 3D cos 3t — §. 


The condition x(0) = § gives C=0. The condition 
(%) = 0 gives D = —@. The solution is therefore 
a Tes Ty 1 1 
x= — Gsindt — Zt 5. 
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UNIT 4 Vector algebra 


Study guide for Unit 4 


This unit assumes no previous knowledge of vectors. You will need to know 
only basic algebra and trigonometry, and how to use Cartesian coordinates 
for specifying a point in a plane. 


The recommended study pattern is to study one section per study session, 
and to study the sections in the order in which they appear. 
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Introduction 


We often need to represent physical quantities such as mass, force, velocity, 
acceleration, time, etc., mathematically. Most of the physical quantities 
that we need can be classified into two types: scalars and vectors. Scalar 
quantities are quantities, like mass, temperature, energy, volume and time, 
that can be represented by a single real number. Other quantities, like 
force, velocity and acceleration, possess magnitude and direction in space, 
and cannot be represented by a single real number; they are called vector 
quantities. 


The definitive vector quantity is displacement. The displacement of a point 
specifies the position of the point in space relative to some reference point. 
We use the concept of displacement whenever we want to describe spatial 
relationships. Consider, for example, the instructions written in blood on a 
pirate’s treasure map: 


Take five paces due north from the big oak tree, then seven paces due 
west, and then dig down for three metres. 


This is a specification of a displacement vector — the displacement of the 
treasure from a reference point (the big oak tree). In fact, this particular 
way of specifying the displacement of the treasure is known as the Cartesian 
description of a displacement, although the pirate probably didn’t know 
that. Alternatively, there is the so-called polar description of the same 
displacement (equating paces with metres): 


Starting at the big oak tree, dig for 9.1 paces along a straight sloping 
line inclined at 19° below the horizontal at a bearing of 54° west of north. 


This ‘distance (or magnitude) plus direction’ specification will also get you 
to the treasure, although less conveniently because it is more difficult to 
dig along a sloping line. These two different specifications are shown in 
Figure 0.1. 


Figure 0.1 


Section 1 defines a vector and discusses ways of representing vectors in two 
dimensions. Section 3 discusses another way of representing vectors, one 
that easily generalizes from two to three (or more) dimensions. Sections 2 
and 4 consider ways of operating on and combining vectors — that is, they 
provide the fundamentals of vector algebra. 
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Section 1 Describing and representing vectors 


1 Describing and representing vectors 


Subsection 1.1 explains what scalars and vectors are. Subsections 1.2 and 1.3 
then explain how to denote vectors symbolically (i.e. algebraically) and how 
to show them in diagrams. Subsection 1.4 explains what is meant by saying 
that two vectors are equal to one another, which is a necessary first step 
in the development of an algebra for vectors. Subsection 1.5 introduces a 
method for representing vectors in two dimensions that can be useful in a 
variety of physical situations. 


1.1 Scalars and vectors 


A scalar is any quantity, such as mass, time, volume and temperature, that 
can be represented mathematically by a single real number (and often a unit 
of measurement). Real numbers themselves are examples of scalars, and you 
can regard the terms scalar and real number as synonymous. Examples of 
scalar quantities, quoted to some convenient degree of accuracy, are: 


the mass of the Earth, 5.975 x 1024 kilograms; 

the temperature of melting ice, 0 degrees Celsius; 

my current bank account balance, —153.12 pounds sterling; 
pi (7), 3.14159... 


A real number is defined by two properties: its modulus |x| and its sign. The modulus of a real number 
Thus the magnitude of a scalar x is |x|. For example, the magnitude _ is also called its magnitude. 
of my current account balance is |—153.12| pounds = 153.12 pounds, which 

sounds a lot better since it doesn’t remind me that I’m in debt. Note that 

magnitudes are always non-negative (i.e. positive or zero). 


A vector quantity is any quantity, such as force, velocity, displacement, etc., 

that has a magnitude and a direction in space (or, in two dimensions, a 

direction in a plane). An example is the velocity of a motor car travelling 

on the M4 motorway from London to Bristol with a speed of 95km per hour The familiar term speed is 
in a westerly direction. The magnitude of the velocity vector is 95 (dropping used to mean the magnitude 
units for convenience), and the direction of the velocity vector is due west. f @ velocity. Speed is a 
Thus the specification of a vector consists of: tion Hepsve sealer: 

(a) a non-negative real number, called its modulus or magnitude; 


(b) a direction in space. 


This unit is mainly concerned with just two vector quantities: displacement 
and velocity. Later in the course you will come across other vector quantities 
such as force, torque and momentum. Fortunately, all vector quantities obey 
exactly the same laws of algebra. Thus what you learn about displacements 
and velocities in this unit can be carried over to all vector quantities. 


1.2 Vector notation 


Vectors are denoted in printed text by bold letters, e.g. v, F. In your 
written work, you should denote vectors by drawing either a straight line or 
a squiggly line under the letter, e.g. v, F or vy, F. Thus if a symbol is used 
to represent the velocity of an object, then it must be handwritten by you 
as either v or vy, (but will be printed in the text as v). 


It is important that you learn to write vectors using the underlining: if 
you do not do so, someone reading your work may not be able to tell 


that you are referring to a vector. In particular, you may lose marks! 
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The modulus or magnitude of the vector v is denoted by |v| or, sometimes, 
where there is no possibility of ambiguity, by v; |v| is a non-negative scalar. 


1.3 Using arrows to represent vectors 


A vector can be conveniently represented in a diagram by an arrow, i.e. a 
straight line with an arrowhead on it. The tail of the arrow may be placed 
at some fixed origin, the direction of the arrow is chosen to represent that 
of the vector, and its length is chosen to be proportional to the magnitude 
of the vector. In Figure 1.1, which uses the origin of the Cartesian coor- 
dinate system as the fixed origin, the shorter arrow represents a vector of 
magnitude 1 in the positive x-direction, and the longer arrow represents 
a vector of magnitude 2\/2 in a direction at 7 radians (45°) to the posi- 
tive x-direction. (Note that we use the convention that positive angles are 
measured anticlockwise.) If we decide to denote these vectors by letters a 
and b, respectively, then we can also put this information on the diagram, 
by writing a and b near the arrowheads, as shown in Figure 1.2. 


Note that in this course, and commonly elsewhere, the arrows representing 
vectors are drawn using thick lines. This helps to distinguish vector arrows 
from other arrowed lines such as those representing the coordinate axes 
(e.g. Figures 1.1 and 1.2) or those representing compass directions (e.g. 
Figure 1.3). 


*Exercise 1.1 


Represent the following two vectors on a diagram by arrows: 
e vector a has magnitude 3 units and points in the positive y-direction; 


e vector b has magnitude 4 units and points in the direction at 3 radians 
(60°) to the positive x-direction. 


Vector notation and the use of arrows in diagrams is now illustrated further 
by specific reference to displacement vectors and velocity vectors. 


Displacement is the position of a point in space relative to some reference 
point or origin. For example, the city of Leeds is 296km from the city of 
Bristol in the direction of 15° east of north (N15°E). The displacement of 
Leeds from Bristol can be specified as the vector 


s = 296kmN 15°E. 


Here the bold symbol s has been used to denote the displacement. Note that 
both magnitude and direction are specified: the magnitude of the displace- 
ment is |s] = 296 km, and the direction is specified by the compass bearing 
N15°E. 


The displacement s = 296 km N15°E can be represented in a diagram by an 
arrow, as shown in Figure 1.3. The length of the arrow represents 296 km, 
which may be shown in the diagram by writing |s| = 296. 


For any two points P and Q, we can define the displacement vector from 
P to Q: it is the vector whose magnitude is the distance from P to Q and 
whose direction is the direction of the straight line from P to Q. A useful 
notation for this vector is PQ (see Figure 1.4). In this context the symbol 
PQ (without an arrow) represents the length of the straight line joining P 
and Q, i.e. PQ = |PQ|. Note that PQ = QP but PO we OP (because PO 
and OP 


are in opposite directions). 
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We read |v| as ‘the modulus 
of v’ or ‘the magnitude of v’, 
or simply ‘mod v’. 


ala 


Figure 1.1 


Figure 1.2 


It would be wrong to write 
s = 296km, because the 
left-hand side is a vector 
symbol and the right-hand 
side is a scalar. 


Nt s 
|s| = 296 
15° 

O 
0 100 200 300 
| l | J 

Scale (km) 

Figure 1.3 


Section 1 Describing and representing vectors 


Displacement vector 


The displacement vector PO is the vector whose magnitude is the dis- 
tance from P to Q and whose direction is the direction of the straight 
line from P to Q. 


One query may have occurred to you. What is the displacement vector of a 
point from itself? In other words, what is the vector PP? Clearly its length 
is zero, but what is its direction? The answer is that it does not have one! 
We define the zero vector to be the unique vector with magnitude zero 
and no direction. It is denoted by 0. Thus we can conclude that PP =0. 


Zero vector 


The zero vector is the unique vector with magnitude zero and no 
direction. It is denoted by 0. 


A constant velocity is also defined by a magnitude and a direction. For 
instance, in a weather forecast, a typical wind velocity might be 35 knots 
from the north-west. It is not sufficient to say that ‘the wind velocity is 
35 knots’; the obvious question about such a statement would be ‘from 
which direction?’. The vector v representing this velocity has magnitude 35 
and direction from the north-west and towards the south-east (since the 
air is travelling in the south-easterly direction). It can be represented on a 
diagram as shown in Figure 1.5. The length of the arrow represents a wind 
speed of 35 knots. 


Note that the direction of a vector consists of two attributes: 


(a) an orientation, represented by the slope of the arrow in diagrams like 
Figures 1.1 to 1.5; 


(b) a sense, represented by the arrowhead. 


For instance, the arrow representing the velocity 35 knots from the north- 
west in Figure 1.5 is a line making an angle of 45° anticlockwise from the 
south direction (the orientation) and an arrowhead pointing towards south- 
east as opposed to north-west (the sense). 


*Exercise 1.2 


The displacement of Birmingham from Derby is 57km in the direction 
S30°W. The displacement of Leicester from Derby is 32 km in the direction 
5S 45°E. 


Draw a diagram, to a suitable scale, representing these two displacements 
by arrows. 
Exercise 1.3 


A car travelling from London along the M1 with speed 70 mph heads in the 
direction N 60°W near Junction 14. 


Represent the velocity of the car by an arrow, drawn to a suitable scale. 


> 
a Q 
P 
Figure 1.4 


Be particularly careful to 
underline the zero vector 
(0 or 0) in your written work! 


| fl 
Scale (knots) 


Figure 1.5 
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1.4 Equality of vectors 


Definition 


Two vectors are said to be equal if they have the same magnitude and 
the same direction. 


You have seen how to represent a vector by an arrow. This definition of 
equality of vectors tells us that the two features needed to define a vector 
uniquely are its magnitude and direction. This means that any two arrows 
drawn at different places on the page but which are equal in length, parallel 
and have the same sense, can be used to represent the same vector. For 
instance, the two arrows in Figure 1.6 are each of length 2 units and point 
in the positive x-direction. They represent two equal vectors, and we write 
b = d. In other words, the arrow representing a vector does not have to be 
drawn so that its tail is at any particular point. 


Example 1.1 


Figure 1.7 shows several vectors represented by arrows drawn to scale. Find 
the vector equal to the vector a. 


ya 
c 
37 b a 
f 2-e 
d 
17 <—_- 
T T T T T T T T ne 
4 = —2 -10 1 2 3 4 5 6 * 
-1- Se od 
a a 
=9 =| 8 
-3- al 
h 
Figure 1.7 


Solution 

We are looking for a vector that is equal in length to a (i.e. one unit), parallel 
to a and points in the same direction (ie. the positive x-direction). There 
are two arrows (and thus vectors) other than a that point in the positive 
x-direction; they are c and h. (The arrow representing d points in the 
negative x-direction.) The magnitudes of c and h are 1 unit and 3 units, 
respectively. Since the magnitude of a is 1 unit, c=a but ha. 


Note that although a and c are drawn at different places in the (2, y)-plane, 
they are equal in magnitude and have the same direction, so they are equal 
vectors. Hi 


*Exercise 1.4 


Which vector in Figure 1.7 is equal to vector b? 
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1.5 Polar representation of two-dimensional vectors 


This subsection introduces a systematic way of specifying the magnitude 
and direction of a vector in a coordinate system. 


You should be familiar with using a two-dimensional Cartesian coordinate 
system for specifying the position (x, y) of a point in a plane, and indeed the 
same system is commonly used for displaying vectors (as in Figures 1.1, 1.2, 
1.6 and 1.7). The plane polar coordinate system, however, is in some 
sense a more natural one for specifying vectors since it effectively regards 
magnitude and direction as two coordinates. 


Let r be a vector on a plane surface. Introduce a Cartesian coordinate 
system, and draw the vector as an arrow with its tail at the origin O, as 
in Figure 1.8. The magnitude of r is |r| =r, the distance of the tip P of 
the arrow from O. The direction of r is specified by the angle @ measured 
(usually in radians) anticlockwise from the positive x-axis. 


P 


> 
O xX=rcos@ x 
Figure 1.8 


We have not quite finished the description, because a vector now has many 
representations (since rotating the line segment OP through 2nz, where n is 
any integer, leaves it unchanged). To avoid this ambiguity, we shall normally 
take ¢ to lie in the range —7 < @< a. (Note that under this convention a 
vector below the x-axis has a negative value for ¢ — see Figure 1.9.) 


Thus the endpoint P of a vector r is specified by the two numbers r (a dis- 
tance) and ¢ (an angle). These two numbers r and ¢ are the plane polar 
coordinates (or simply polar coordinates) of the endpoint P of the vec- 
tor r, when the tail of its arrow is at O. We use the notation (r, ¢) in order 
to distinguish polar coordinates from the Cartesian variety, so the vector is 
now specified as 


r= (ro). 


You can see from Figure 1.8 that the polar coordinates (r, ¢) of P are related 
to the Cartesian coordinates (x,y) of P by the following formulae: 


x=rcos¢d, y=rsing; 
r=(¢+y’)"?, tand=y/z. 


However, the statement tan ¢ = y/x does not define ¢ uniquely since, for 
example, tang = tan(7+¢). To pin down the value of ¢ in the range 
—1<@< 7, we can use the two equations 


sing=y/r and cosd=2/r. 


In practice, when finding the angle ¢ from the values of x and y, it usually 
helps to sketch the Cartesian coordinates in the (a, y)-plane so that you can 
see in which quadrant @ must lie. The signs of sin and cos for angles in 
the four quadrants are shown in Figure 1.10: you will find it useful to know 
these. (A simple acronym to aid the memory is ‘CAST’: starting from the 
lower right, and working anticlockwise round the quadrants, the following 
are positive: Cos, All (of sin, cos, tan), Sin and Tan.) 


In fact, a vector has an 
infinite number of 


representations! 
y positive @ 
b 
x 
y 
> 
x 
b 
negative @ 
Figure 1.9 
ya 
quadrant 2 | quadrant 1 
S A 
sin > 0 sin > 0 
cos < 0 cos > 0 
it 
sin < 0 sin <0 : 
cos < 0 cos > 0 
T Cc 
quadrant 3 quadrant 4 
Figure 1.10 
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Example 1.2 


Give the polar representation of the vectors a, b, e and g in Figure 1.11. 


Figure 1.11 


Solution 


In Figure 1.11 each vector is drawn as an arrow from the origin, so the 
polar representation of each vector is given by the polar coordinates of its 
endpoint. In some cases we can specify r and ¢ simply from inspection of 
Figure 1.11, but we shall use the above formulae in order to illustrate the 
general method. 


The endpoint of a has Cartesian coordinates (1,—1), so we have 

r= (P+ (-1)!? = v3, 

sing=y/r=—1/V2 and cos¢=2/r=1/Vv2. 
Thus ¢ = —% radians, ie. a = (/2,—4). (Since —3 < —4 <0, the angle 
coordinate —7 indicates that a should lie in the fourth quadrant, which is 
confirmed by Figure 1.11.) 


Vector b is of length 2 units and points in the positive y-direction. The 
Cartesian coordinates of its endpoint are (0,2), so we have 


r= (07 +27)? = 2, 

sng=g/r=2/2=1 and cosp=—az/r=0/2—0, 
Hence @ = § radians (which is obvious from the fact that b points in the 
positive y-direction). Hence b = (2, 5). 
The endpoint of vector e has Cartesian coordinates (—1, 2), so 


r= ((-1)? +27)? = V5, snd=2/V5 and cos¢ =—1/V5, 


giving @ = 2.034 radians. Hence e = (V5, 2.034). (Since 5 < 2.034 < 7, the 
angle coordinate 2.034 indicates that e should lie in the second quadrant, 
which is confirmed by Figure 1.11.) 


Finally, g is of unit length (i.e. of length 1 unit) and points in the positive 
x-direction. The Cartesian coordinates of its endpoint are (1,0), so 


r=(12+407)/%=1, singd=0 and cosd=1. 


Thus g = (1,0). (This is an exceptional case where the numerical values of 
the coordinates are the same in the two coordinate systems.) ll 


*Exercise 1.5 


Complete Table 1.1. Each row should show the Cartesian and corresponding 
polar coordinates of a particular point. If any entry is invalid, say so and 
explain why. 
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Table 1.1 
Cartesian Polar 
coordinates (x,y) | coordinates (r, ) 
| 4) 
| (4,-2) 
| (6,7) 
| (-1,-1) 
| (=1,7) 


(108, exp(0.17)) 


*Exercise 1.6 


As you saw earlier, the displacement of Leeds from Bristol can be expressed 
as s = 296km N 15°E (see Figure 1.3 on page 156). Express this vector in 
polar form (r,@) using a suitable coordinate system. 


The polar representation of vectors can be a useful representation in a variety 
of physical situations, as you will see later in the course. (It is generalized 
to three dimensions in Unit 23.) 


End-of-section Exercises 


Exercise 1.7 


The following is a list of some physical quantities: temperature, velocity, 
volume, energy, force, displacement, time, acceleration. Decide which are 
scalar and which are vector quantities. 


Exercise 1.8 


What are the polar coordinates of a point Q whose Cartesian coordinates 
are (0,—3)? What is the magnitude of the vector OG where O is the origin 
of coordinates? 


2 Scaling and adding vectors 


This section defines two arithmetic or algebraic operations involving vectors. 
The first and simpler of these is the multiplication of a vector by a scalar, 
or scaling of a vector. The second is the addition of two vectors to give a 
third vector called the resultant of the two vectors. 


2.1 Scaling of a vector 


Consider vectors c and h in Figure 1.7 (page 158). Both vectors point in 
the same direction, but h has a length three times that of c. We say that h 
is a scaling of ¢ by the number 3, and we write h = 3c. 
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Generally, if v is a vector and m is a positive number, then the product mv 
is a vector in the same direction as v but with magnitude m|v\, i.e. m times 
the magnitude of v. This multiplication of a vector by a scalar is called 
scaling or scalar multiplication, and mv is called a scalar multiple of v. For 
example, if v has magnitude 4 and points in the positive x-direction, then 
3v has magnitude 12 and points in the positive x-direction also. This is 
illustrated in Figure 2.1. 


Or-——_>4_#@_ Oe. 
v x 3v x 


Figure 2.1 


We can also scale a vector v by a negative number. When m is negative, 
the vector mv has magnitude |m||v| and points in the opposite direction 


to v. A special case is when m = —1. Then the vector (—1)v has the same 
magnitude as v but points in the opposite direction; see Figure 2.2. We 
normally write (—1)v simply as —v, i.e. (—1)v = —v. 


What happens when we multiply a vector by zero (m = 0)? The above 
definitions imply that the result should be a vector with magnitude zero. 
You will recall from Section 1 that there is a special vector with magnitude 
zero, namely the zero vector, 0. Thus Ov = 0. 


Definition 

For any vector v and any real number m, the scalar multiple mv is 
the vector with magnitude |m||v| which is: 

e in the same direction as v if m > 0; 

e in the opposite direction to v if m < 0; 

e the zero vector (i.e. with unspecified direction) if m = 0. 


The multiplication of v by m is called scaling or scalar multiplica- 
tion. 


Example 2.1 


(a) Let u represent the velocity of my car travelling with a speed of 30 mph 
along a straight road due north. Write down, in terms of u, the velocity 
of a car overtaking me and travelling at 45 mph. If another car is trav- 
elling in the opposite direction to me with speed 60 mph, write down 
this car’s velocity in terms of u. 


(b) If ABCDEF is a regular hexagon (Figure 2.3) and, for example, AB 
represents the displacement vector from A to B, write down algebraic 
relations connecting: 


(i) AB and ED; (ii) AF and DC. 


Solution 


(a) The velocity vector u has magnitude 30 mph and points due north. The 
car overtaking me is travelling in the same direction but has a velocity 
of magnitude 45 mph; suppose that its velocity vector is denoted by v 
(see Figure 2.4). Then v is parallel to u and has the same sense as u, 
and |v| = \ul. Therefore v is just a scaling of u, i.e. 


v= 1.5u. 
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Note that there is no 
multiplication sign between 
the m and the v. In vector 
algebra the dot and cross 
symbols are reserved for other 
products, to be discussed in 
Section 4. 


v 
O -V 
Figure 2.2 
A B 
F C 
E D 
Figure 2.3 


Two vectors are parallel if 
they have either the same, or 
opposite, directions. 


Section 2 Scaling and adding vectors 


Now suppose that the velocity of the car travelling in the opposite di- 
rection is denoted by w. Then w is parallel to u but has the opposite N 
A 


sense to u, and |w| = $8|u|. So we can write 
50-7 
w = —2u, v 
40 - 
where the negative sign indicates the opposite sense. 
30-5 u 
(b) The opposite sides of a regular hexagon are parallel, and all the sides 
have the same length. 20- 
(i) Thus the displacement vectors AB and ED have equal magnitudes 10 
and the same direction. So we have e 
AB = ED. 
Figure 2.4 


(ii) The displacement vectors AF and DC have equal magnitudes but 
opposite directions, thus 


AF =—-—DC (or, equivalently, DC = —AF). | 


*Exercise 2.1 


(a) If d is the displacement vector from Bristol to Leeds, write down in 
terms of d the displacement vectors from Leeds to Bristol and from 
Leeds to Leeds. 


(b) If v represents the velocity of a wind of 35 knots from the north-east, 
what vectors represent the following? 


(i) A wind of 70 knots from the north-east. 
(ii) A wind of 35 knots from the south-west. 
(iii) Still air. 
(c) Relate the direction and magnitude of —1.5v to those of v, where v is 
any given non-zero vector. Do the same for —kv, where k is an arbitrary 


positive number. B C 
(d) If ABCD is a parallelogram (Figure 2.5) and, for example, AB repre- Lo 


sents the displacement vector from A to B, write down algebraic rela- 
tions connecting: 


(i) AB and DC; (ii) BC and DA. Figure 2.5 


(e) If v is any non-zero vector, what are the magnitude and direction of the 


A D 


1 
vector —v? 


Iv| 


Unit vectors 


1 
The vector ie in Exercise 2.1(e) is a vector that has magnitude 1 and 
v 


points in the direction of v. It is called the unit vector in the direction of v. 
The unit vector in the direction of v is often denoted by the symbol v. 


Definition 


For any non-zero vector v, the unit vector in the direction of v is the 
vector 

ee 1 

vV=—Vv 
lv| 
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Unit vectors are often used to denote directions in the plane, or in space. 


A particular example is provided by the unit vectors in the positive directions 
of the x- and y-axes in the plane Cartesian coordinate system. These unit 
vectors are denoted by i and j, respectively, and are called Cartesian unit 
vectors. 


Figure 2.6 shows these Cartesian unit vectors and two other vectors, a and b. 
The vector a has magnitude 2 and points in the positive x-direction; b has 
magnitude 3.5 and points in the positive y-direction. The unit vector i has 
magnitude 1 and points in the same direction as a. Thus we can write a in 
terms of i by a scaling: 

a= 2i. 
Similarly, we can write b in terms of j: 


b =3.5j. 


Any vector parallel to the x- or y-axis can be written as a scaling of i or j. 


Exercise 2.2 


Four vectors, a, b, c and d, of magnitudes 2, 2.5, 3 and 1, respectively, are 
shown in Figure 2.7. The directions of the four vectors are defined by the 
arrows. Write down a, b, c and d as scalings of the Cartesian unit vectors 
i and j. 


Ya 
37 - qe 
a 
24 ———— 
| | 
b 
——_ 1- | | 
| I | | 
++ T T [> 
4 3 2 -1 1 2 3 4 * 
he 
dV _5 
i 
35 
Figure 2.7 


Exercise 2.3 


Let the unit vectors i and j denote the directions of east and north, respec- 
tively. Specify the following vectors as scalings of i and j. 


(a) The wind velocity of 35 km per hour due south. 
(b) The displacement of Bristol from London (112 miles due west). 


(c) The displacement of London from Bristol. 
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We shall develop the 
Cartesian representation of 
vectors in Section 3. 


Figure 2.6 


Note that although i and j are 
shown in Figure 2.6 with their 
tails at the origin, this is not 
necessary. They can be drawn 
at any convenient position, 
provided only that they are of 
unit magnitude and point in 
the positive z- and 
y-directions, respectively — 
see, for example, Figure 2.7. 


Section 2 Scaling and adding vectors 


2.2 Addition of vectors 


What is meant by the addition of vectors? Suppose that we make a journey 
from Bristol to Leeds, and then another journey from Leeds to Norwich. The 
first journey produces a displacement of d; and the second a displacement 
of dg. The net result of the two journeys is a displacement of ds from Bristol 
to Norwich. This is illustrated by the triangle of displacements shown in 
Figure 2.8. Displacements are said to add by the triangle rule, and we write 
d3 = d,; + dg. The vector dg is called the resultant of d, and dg. 


Velocities also add by the triangle rule, and so do forces, accelerations and 
all other vector quantities. Thus the triangle rule is also called the vector 
addition rule. 


Triangle rule or vector addition rule 


To add any two vectors a and b: choose an origin O; draw the line 
OP in the direction of a and with length equal to the magnitude of a; 
and draw the line PQ in the direction of b and with length equal to 
the magnitude of b (as in Figure 2.9). Then a+b is the vector with 
magnitude equal to the length of OQ and with the direction from O 
to Q. The vector a+ b is called the sum or resultant of a and b. 


Note that the sum of two displacement vectors can also be written using the 
notation 


OP + PG = 00. 


Now recall that when discussing displacements we mentioned the zero vector 
0 (representing no displacement). Once addition of vectors is introduced, we 
need the zero vector in order to answer questions such as ‘what is i+ (—1)i?’. 
Geometrically, no construction is needed when adding the zero vector, which 
obeys the rather obvious rule 


a+O=a. 


Exercise 2.4 
Three vectors a, b and c of magnitudes 3, 2 and 4 are shown in Figure 2.10. 
(a) Draw a rough sketch to show the vectors a+b anda+ce. 


(b) Sketch the vector —b, and draw a rough sketch to show the addition of 
a and —b. 


Exercise 2.4(b) suggests a definition of vector subtraction. To subtract 
the vector b from the vector a, we add the vectors a and —b by the triangle 
rule of vector addition; that is, in symbols, 


a—b=a+(-b). 


*Exercise 2.5 


A vector a has magnitude 3 units and points in the positive x-direction. A 
vector b has magnitude 4 units and points in the positive y-direction. Draw 
a diagram showing the vectors a+ b and a— b. 


d, Leeds 
d, 
Norwich 
d, 
Bristol 
Figure 2.8 
a+byO 
b 
O P 
a 
Figure 2.9 


Figure 2.10 
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Vector addition is commutative, i.e. the order in which we add two vectors 
does not matter. This can be illustrated by reference to vectors a and c of 
Exercise 2.4 (see Figure 2.11). The triangle OP,Q illustrates the addition 
a+c, while triangle OP2Q illustrates c+ a. The same resultant eye) is 
obtained in both cases. Thus 


a+c=c+ta. 


Exercise 2.6 


For the particular cases of the vectors a, b, ec defined in Exercise 2.4, and 
for the scalar m = 2, draw sketches to illustrate the associative property of 
vector addition, 


(a+b)+c=a+(b-+c), 
and the distributive property of scaling over vector addition, 


m(a+b) =ma+ mb. 


An alternative geometrical construction for adding two vectors can be seen 
from Figure 2.11. It is called the parallelogram rule. Draw the two 
vectors OP, and OP» with the same beginning point O. Complete the 
parallelogram OP,QP 2. Then the resultant vector is the vector OO on 
the diagonal of the parallelogram. The parallelogram rule gives the same 
resultant as the triangle rule. 


2.3 Algebraic rules for scaling and adding vectors 


Subsections 2.1 and 2.2 showed how to multiply a vector by a scalar and 
how to add vectors, i.e. what is meant by mv and a+b. We also saw 
illustrations of the commutative, associative and distributive rules. These 
are only some of the algebraic rules for manipulating vectors by addition 
and scaling. A complete list of these rules, which apply whether or not the 
vectors are confined to a plane, is given below. 


Algebraic rules for scaling and adding vectors 


Let a, b and c be vectors, and let m, m, and mg be scalars. 

1 Addition is commutative: a+ b=b-+a. 

2 Addition is associative: (a+b)+c=a+(b+c). 

3 ma is a vector with magnitude |m||a|, in the same direction as a 
when m > 0 and in the opposite direction when m < 0. 
Scaling is associative: m (mga) = (mimg)a. 


4 

5 Scaling is distributive: (m1 + m2)a = mya + maa. 

6 Scaling is distributive over vector addition: m(a+b) = ma-+ mb. 
7 


Addition and scaling involving the zero vector are as expected: 
O0+a=aand 0a=0. 
8 Subtraction is defined by a—b=a-+(—l)b. 


These rules allow us to manipulate algebraic expressions involving scalings 
and vector addition in a familiar way. 


Example 2.2 
Simplify the expression 


2(a+ b) + 3(b+c) —5(at+b-—c). 
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ya 
P, 40 
c c 
O aP, x 
Figure 2.11 


Notice that these rules say 
nothing about the 
multiplication of one vector 
by another: vector 
multiplication is defined in 
Section 4. Nor has anything 
been said about division by a 
vector: in fact, division by a 
vector is not defined. 


A more abstract approach 
would be to define a vector to 
be something that obeys 
these rules, then explore the 
consequences. This is the 
approach taken in the 
second-level pure 
mathematics course. 


Section 2 Scaling and adding vectors 


Solution 


Strict use of the rules requires us to write the expression solely in terms of 
addition. So we have 


2(a+ b) + 3(b+ cc) —5(a+b—c) 


= 2(a+ b) + 3(b+€) + (—5)(a+b+ (—1)c) (using Rule 8) 

= 2(a+ b) + 3(b+€) + (—5)((at+ b) + (—1)e) (using Rule 2) 

= 2a+ 2b+ 3b+ 3c + (—5)(a+ b) + (—5)((—1)c) (using Rule 6 three times) 
= 2a+ 2b+ 3b+ 3c + (—5)(a+ b) + 5c (using Rule 4) 

= 2a+ 2b+ 3b+ 3c + (—5)a+ (—5)b + 5c (using Rule 6) 

= 2a+ (—5)a+ 2b+ 3b+ (—5)b+ 3c + 5c (using Rule 1 several times) 
= (2—5)a+(2+3—5)b+(3+4+5)c (using Rule 5 four times) 
= (-3)a+0+ 8c (using Rule 7) 

= (-3)a+ 8c (using Rule 7) 

= 8c +(-3)a (using Rule 1) 

= 8c + (—1)(3a) (using Rule 4) 

= 8c — 3a (using Rule 8). 


However, because Rules 1, 2, 4, 5, 6, 7 and 8 are exactly the same as 
the familiar rules for manipulating algebraic expressions involving scalar 
quantities, we would usually write the solution more succinctly as 


2(a+ b) + 3(b+c) —5(at+b—c) 
= 2a+ 2b+ 3b+ 3c — 5a—5b+ 5c 
= (2—5)a+(2+3-—5)b+(34+5)c 
=8c-—3a. Hf 


In the following exercises, use the more succinct method. 


Exercise 2.7 
Simplify the expression 4(a — c) + 3(c — b) + 2(2a — b — 3c). 


*Exercise 2.8 


Find the vector x in terms of a and b in the following vector equations. 
(a) 2b+4x=7a (b) n(b —a) + x=m(a—b) 


In Subsection 1.1 a vector was defined as a quantity having a magnitude and 
a direction. In fact, this definition is incomplete in that it does not include 
a rule for combining two such quantities. Hence a complete definition of a 
vector is as follows. 


Vectors 

e A vector has magnitude and direction. 

e Any two vectors can be added by the triangle rule. 

e A vector can be scaled by a real number in such a way that the 
above rules apply. 


It is a rather surprising fact that so many physical quantities — displace- 
ment, velocity, acceleration, force, torque, momentum, to name but a few — 
all qualify as vectors under the above simple definition. This is one reason 
why the subject of vectors is so important. 
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End-of-section Exercises 


Exercise 2.9 


The vectors a and b are represented by the arrows shown in Figure 2.12. 
The magnitudes of a and b are 4 and 6, respectively. Draw a sketch to show 
the vectors a+ b, a—b and 2a+ xb. 


Exercise 2.10 


If v = —4.7u, what can you say about the magnitude and direction of v in 
terms of the magnitude and direction of the non-zero vector u? 


Exercise 2.11 


If ABCD is a quadrilateral, with AB denoting the displacement vector from 
A to B, and BC ; CD: DA defined similarly, show that 


AB+BC+CDB+DA=0. 


*Exercise 2.12 


Two vectors p and q are defined in polar form: p = (3, 5), q = (4,7). Sketch 
p, gq and p+q, and give the polar forms of 5p, —q and p+ q. 


Exercise 2.13 


(a) Which of the following proposed general rules is true for the scalar 
multiplication of a vector in polar form? (Assume m > 0.) 


mr, ¢) = (mr, 4), 


mr, 6) = (mr,md). 


(b) Does the following proposed general rule hold for the addition of vectors 
in polar form? 


(71,1) + (r2, bo) = (ri + 72, 4 + $9) 


3 Cartesian components of a vector 


So far we have approached vectors, and the laws of vector addition and scal- 
ing, geometrically. To add vectors geometrically requires drawing diagrams 
representing the vectors by arrows. An alternative, and sometimes more 
convenient, algebraic approach to representing vectors is developed in this 
section, first in two dimensions and then in three. 


3.1 Vectors in two dimensions 


We have already seen in Subsection 2.1 how to write vectors that are parallel 
to the z-axis or y-axis as scalar multiples of the Cartesian unit vectors i 
and j, respectively. (Recall that i and j are the unit vectors in the directions 
of the positive xz- and y-axes, respectively.) Of course, in general, vectors 
do not lie parallel to either the z-axis or the y-axis. However, you will see 
in this subsection how use of the rules for vector addition and for scalar 
multiplication allows us to write any vector in the (z, y)-plane in terms of 
the Cartesian unit vectors i and j. 


Consider an arbitrary vector a = OA in the (x, y)-plane, whose tail is at 
the origin O, as shown in Figure 3.1. The vector OA is called the position 
vector of the point A, and its endpoint is determined by the Cartesian 
coordinates a, and ag of A, i.e. by the distances OB and OC, respectively. 
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yA 
b 
a a 
O x 
Figure 2.12 


A= (a, a>) 


Figure 3.1 


Note that if A were in one of 
the other three quadrants of 
the plane, then one or both of 
a1, @2 would be negative. 


Section 3 Cartesian components of a vector 


Furthermore, the vectors OB and OC can be written as scalings of the 
Cartesian unit vectors i and j: 


OB = aii and OC = aj. 


Hence the triangle rule (or parallelogram rule) for the addition of vectors 
allows the vector a to be expressed as the sum of OB and OC , Le. as 


a=OB+0C or a= ajzit aaj. 


The latter is called the component form of a, and the numbers a; and a2 You may also see these 


are called the i- and j-components of a, respectively. numbers referred to as the 
x- and y-components of a. 
When the tail of the vector a is not at the origin, its components are defined 


in an obvious way. 


Figure 3.2 


Referring to Figure 3.2, the components of a are 
a=qa—pi and ag=q-— pp. 


A shorter way of writing a vector in component form is as an ordered pair 
of numbers, (a1, a2), where the unit vectors i and j are not shown explicitly. 
This notation needs to be used with care because the coordinates of a point 
in a plane are also denoted in this way, and vectors are conceptually different 
from points. To avoid such confusion, in this course the column vector 
notation 
ay This is the way in which 
a= | | many computer algebra 


packages display vectors. 
will be used instead. In the text, to save space, the column vector will often 


be written as a= [a, ay], where the transpose symbol T here changes the 
row into a column. 


Definition 


A vector a = P@ in the (x, y)-plane, where P is the point (pj), p2) and 
Q is the point (q,q2), has component form 


a= ayzi+t aoj, 


where a, = qi — p1 and ag = q2 — pa, and i and j are the Cartesian unit 
vectors. 


The component form may also be written as 


a= |" or a=[a, ay]?. 
ag 


The numbers a; and az are the (Cartesian) components of a. 
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*Exercise 3.1 


Write each of the vectors in Figure 1.7 (page 158) in the form a = aji+ aaj 
and as a column vector. 


The magnitude of a vector given in component form is found very easily. 
For example, the magnitude of the vector a in Figure 3.2 is just the length 
of the line PQ. This is found by Pythagoras’s Theorem to be \/az + a3. 


Magnitude of a two-dimensional vector in component form 


If a= PQ = ayi+ aaj, where P and Q have coordinates (p;,p2) and 
(q1, 92), respectively, then 


jal = 4/0? + a2 = (qi — pi)? + (a2 — po)?. 


Vectors in component form can also be added and scaled very easily, by 
making use of the algebraic rules for the scaling and adding of vectors. For 
example, 


at b = (ayi + aaj) + (b1i + bj) 
ayzi + aaj + bi + boj 

= (ayi + by i) + (aaj + baj) 
= (a1 + bi)i+ (a2 + b2)j. 


So, to add two vectors one adds their respective components. Similarly, 
ma = m(ayi + aaj) 
= (may)i + (mag)j, 


so scaling a vector is achieved by scaling its components. 


Adding and scaling two-dimensional vectors in component 
form 


If a= ayi+ aaj, b = bji+ boj and m is a scalar, then 
a+b = (a; + b1)i + (ag + b2)j 
and 
ma = (maj,)i+ (mag)j. 
Equivalently, using column vector notation, 
[| * [2] = [moa] 
a2 bg az + be 


and 


Exercise 3.2 


Figure 3.3 shows four vectors in the (a, y)-plane. 
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Figure 3.3 


(a) Write down the vectors in component form. 


(b) Draw a diagram to verify that the scaling 3.5a is the same when obtained 
geometrically or algebraically using components. 

(c) Use the triangle rule to obtain the vector a+ b. Verify that this vector 
is the same as that obtained by adding the component forms of a and b. 


Find, algebraically, the components of the vector 2a + b—c. Hence find 
the magnitude of the vector 2a+b-—c. 


*Exercise 3.3 


(a) Find the numbers p and q if r = baal s= 7 and r=s. 


(b) Find the magnitude of the vector t if t = u+v, where u = os | 


il 
and v= = eth 


Exercise 3.4 


The three vectors a, b and c in Figure 3.4 are specified in polar coordinates 
by 


T 3m T 
a= (2,3); b=(3,t), ¢=(1§)- 
(a) What are the magnitudes of the three vectors? 
(b) Write down the vectors a, b and c in terms of i and j. 


(c) Obtain the vector a+ c in terms of i and j. 


In Exercise 3.4 the vectors were given in polar coordinate form, which is 
just a systematic way of specifying magnitude and direction. The process 
of finding the Cartesian components of a vector given its magnitude and 
direction is known as resolving a vector into its components. This is 
essentially what you did in Exercise 3.4(b). Thus given the magnitude |a| 
of the vector a in Figure 3.5, and its direction @, we can resolve it into its 
components: 


a, =|alcos@ and ag = |alsing. 


Conversely, given the components a; and ag of a vector, we can specify its 
magnitude and direction: 


lal = (aj +43)", cose =ar/lal, sing = a2/lal. 


y 
b 
a 
c 
A x 
Figure 3.4 
y 
a 
j 
¢ 
i x 
Figure 3.5 


You had practice at doing 
these calculations in 
Subsection 1.5. 
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You will see this idea again in Section 4. For now, note that if you wish to 
add two vectors in polar form it will be necessary first to resolve them into 
their Cartesian components (since there is no convenient formula for vector 
addition in polar coordinates). 


Exercise 3.5 


(a) Resolve the vector v of magnitude 5 shown in Figure 3.6 into its Carte- 
sian components. 


(b) Find the magnitudes and directions of the vectors 


a=V3i-j and b= —3i+3j. 


3.2 Vectors in three dimensions 


Thus far we have discussed vectors in the plane, reaching the component 
representation of such vectors in the previous subsection. However, the 
world is three-dimensional, and few real problems are restricted to a plane 
surface. For example, starting at point A at one corner of the cube shown in 
Figure 3.7, you can reach the opposite corner S' by three successive displace- 
ments: AG + OB + BS. In order to work with such addition of displace- 
ments in three dimensions, it is necessary to introduce a three-dimensional 
coordinate system. 


A three-dimensional Cartesian coordinate system 


Consider a two-dimensional Cartesian coordinate system Oxy. Draw a third 

axis, the z-axis, through the origin O, perpendicular to both the x- and y- 

axes of the two-dimensional system. This produces a coordinate system with 

three mutually perpendicular axes, the x-, y- and z-axes (see Figure 3.8), 

intersecting at O. Alternatively, the coordinate system can be characterized 

by three planes: 

e the (a, y)-plane, which contains the z- and y-axes and is perpendicular 
to the z-axis; 

e the (x, z)-plane, analogously defined; 

e the (y, z)-plane, again analogously defined. 

Any point P can be represented uniquely by its perpendicular distances from 


the (x, y)-, (x, z)- and (y, z)-planes. These distances, called the (Cartesian) 
coordinates of P, are shown in Figure 3.8. 


z-axisA ZA 
(y, z)-plane 
gC 4z--- > > > a 
Be Se Sy “ oe I 
p zB es, 4 3 (2, 3; 4) ¢ | 
p @ 4 ge | 
(* — ————_ e | P | 
Iie ad | | 2- ] } 
| P3 \P | | | | 
| I } 
1- ] 
| O | | | | 
> O | J > 
: Pr, 7B y-axis 112,73 y 
i Pi ly 1 a 
eee ee BiKe ee ee en 
A Q 2 
AJ) pale 
X-axis x 
Figure 3.8 Figure 3.9 
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Figure 3.6 


Figure 3.7 


The z-axes shown in 


Figures 3.8 and 3.9 are meant 


to point out of the plane of 
the page. 


Section 3 Cartesian components of a vector 


QP, RP and SP are perpendicular to the (z,y)-plane, (x, z)-plane and 
(y, z)-plane, respectively. 


We denote the point P by the ordered triple of coordinates (p1, p2, p3), where 


pi = SP =OA, 
po = RP = OB, 
P3 = QP= OC. 


For example, the point (2,3,4) is shown in Figure 3.9. 


When drawing Figure 3.9 it was necessary to choose one of two possible ways 
for the positive z-direction to be defined; these are shown in Figure 3.10, 
where in both cases the y-axis is meant to point into the plane of the page, 
away from you. 


The usual convention for relating the positive directions of x, y and z is given 
by the following rule, called the right-hand rule. The right hand is held 
with the middle finger, first finger and thumb placed (roughly) perpendicular 
to each other, and the other two fingers closed (see Figure 3.11). If the 
thumb and first finger are pointing in the directions of the positive «- and 
y-axes, respectively, then the middle finger is pointing in the direction of 
the positive z-axis. 


Alternatively, you can think of Figure 3.9 as showing a corner of a room 
(with the z-axis pointing upwards). If you are standing in the corner facing 
outwards, then the left-hand edge of the floor is the y-axis, and the right- 
hand edge is the x-axis. A coordinate system defined in this way is called 
a right-handed system. Only right-handed systems will be used in this 
course. The systems drawn in Figure 3.9 and the top of Figure 3.10 are 
right-handed systems. 


positive z-direction 


gle 


positive x-direction 


positive y-direction 


Figure 3.11 


An alternative definition of the same positive z-direction is given by the 
screw rule, stated as follows. Suppose that we are turning a screw into 
a piece of wood; then a clockwise rotation makes the screw move into the 
wood (see Figure 3.12(a)). If we turn the screw in the sense from x to y 
as shown in Figure 3.12(b), then the direction in which the screw moves is 
along the positive z-direction. 


For the rest of this unit the screw rule will be used to characterize a right- 


positive z 


positive z 


Figure 3.10 


(a) 


x 


(b) 


clockwise 


turn 


handed system, but you should use whichever rule you find easier to apply. Figure 3.12 


a 
turn x to y 
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Exercise 3.6 


Decide which of the sets of perpendicular axes in Figure 3.13 define right- 
handed coordinate systems. 


(c) 


Figure 3.13 


(The z-axis points out of the plane of the paper in (a) and (b). The z-axis 


points into and out of the plane of the paper in (c) and (d), respectively.) 


The component form of three-dimensional vectors 


The algebraic representation of vectors can be extended to vectors in three 
dimensions, such as in Figure 3.14. The vector a, drawn from the origin O, is 
the position vector of point A with three-dimensional Cartesian coordinates 
(a1, @2,a3). A third Cartesian unit vector k is introduced to represent the 
positive z-direction. We now have three Cartesian unit vectors, i, j and k, 
which are perpendicular to each other. The vector a may thus be written 
in component form as 
ay 
a=aji+a2jt+az3k or a= | ag or a= |a, a2 43] 
a3 


Definition 


The position vector of a point A relative to the origin O of three- 
dimensional space is the displacement of A from O, i.e. the vector 


a= OA. 


The i-, j- and k-components of the position vector a are the coordinates 
a1, @2 and ag of the point A, respectively. 


The components of vectors not based at the origin are defined similarly, as 
follows. 


Definition 
A vector a= PO in three-dimensional space, where P is the point 
(p1,P2,p3) and Q is the point (q1, q2,q3), has component form 

a= aji+ agj + azk, 


where a, = q1 — Pi, @2 = q2 — p2, 43 = q3 — p3, and i, j, k are the 
Cartesian unit vectors. The numbers aj, a2, a3 are the (Cartesian) 
components of a. 
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Figure 3.14 


These may sometimes be 
referred to as x-, y- and 
z-components. 


Note that the component 
form may also be written as 


ay 
a=} a2 
a3 


or 


Section 3 


As in two dimensions, the operations of vector algebra can be expressed in 
terms of components. 


Adding and scaling three-dimensional vectors in component 
form 


If a = ayi+ aoj + agk, b = 11+ boj + b3k and m is a scalar, then 
at+b= (ay + bi )i + (a2 + b3)j + (a3 + b3)k 
and 


ma = (maj)i+ (mag)j + (maz)k. 


The magnitude of a vector in terms of its components a1, a2, a3 can be found 
using Pythagoras’s Theorem (see Figure 3.15). The length ON is \/at + a3, 
and OA? = ON? + NA?. But OA = |al, thus 


la] = \/a? +. a3 +2. 


x 


Figure 3.15 


This can be summarized as follows. 


Magnitude of a three-dimensional vector in component form 


Ifa= PO = a,i+ aaj + ag3k, where the points P and Q have coordi- 
nates (p1,p2,p3) and (q1, 42,43), respectively, then 


lal = fo} + of +03 = Ml E+ Ue + Pol. 


*Exercise 3.7 
Given vectors a=i+j+k, b= 2i-—3j—kandc=3i+k: 


(a) express d = 2a — 3b and e = a — 2b + 4c in component form; 


( 


b) 
(c) evaluate |a|, and write down a unit vector in the direction of a; 
(d) 


Exercise 3.8 


find the magnitudes of the vectors d and e; 


find the components of a vector x such that a+ x = b. 


1 2 
Find the magnitude of the vector p=3|0] — 3 
6 —1 


Cartesian components of a vector 
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Vector equation of a straight line 


One useful application of position vectors (in two or three dimensions) is in 
obtaining a vector equation of a straight line. 
Example 3.1 


Find the position vector of a point T lying on the straight-line segment PQ 
(see Figure 3.16) in terms of the position vectors of P and Q. 


Solution 


Let T be any point on PQ (see Figure 3.16). The position vector OT of T 
relative to the origin can also be written, using the triangle rule, as 


OT = OP + PT. 


Now PT = sPQ, for some number s, and the point 7’ traces out the line 
segment PQ as s varies from 0 to 1. Thus the straight-line segment PQ is 
described by the vector equation 


OT =OP+sPQ (0<s<1). 


Writing p = OP, q= OO, i= OT, and noting (using the triangle rule) that 
PO = OO _~OP= q — Pp, this equation can also be written as 


t=p+s(q-—p)=(1-s)p+sq (O<s<1). Ef 


Note that if the parameter s in Example 3.1 is allowed to range over all the 
real numbers (—co < s < oo), then the point T traces out the entire straight 
line of which PQ is a segment. Also note that the ideas in Example 3.1 are 
easily extended to three dimensions. 


Vector equation of a straight line 


If P and Q are any two distinct points on a straight line in space, 
with position vectors p and q, respectively, with respect to some given 
origin, then the vector equation of the straight line is 


t=(1-—s)p+sq (-oo<s<oo), 


where t represents the position vector of any point on the line. 


*Exercise 3.9 


Write down, in component form, the vector equation of the straight line on 
which lie the points with Cartesian coordinates (1,1,2) and (2,3, 1). 


End-of-section Exercises 


Exercise 3.10 

Let a = 2i —j, b= 1+ 3j + 5k and c =j — 2k. 

(a) Find the magnitudes of a and b, and describe the direction of a. 

(b) Find the vectors a+ b, 2a — b and c+ 2b — 3a in component form. 

(c) What is the endpoint Q of the displacement represented by the vector 
2a — b if (0, 2,3) is its beginning point P? 

Exercise 3.11 


Write the vectors 0, i, j and k as column vectors in three dimensions. 
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yA Pp 
T 
p 
Q 
q 
O 
Figure 3.16 


If0<s< 1, then the 
equation represents only the 
line segment PQ. 


av 


4 Products of vectors 


So far in this unit we have defined two algebraic operations: vector addition 
(by the triangle rule) and the scaling of a vector. The addition of vectors can 
be usefully applied only to two vectors representing the same type of physical 
quantity. For example, the addition of a displacement and a velocity has no 
physical meaning. However, vectors representing the same or different types 
of physical quantities can be combined in operations that are called the dot 
product and the cross product. They are called products because in some 
respects they behave like ‘multiplications’ in the algebra of real numbers. 
Dot products and cross products of vectors have numerous applications in 
geometry, mechanics and electromagnetism. 


In this section the dot product and cross product are defined geometrically 
and also in terms of components of vectors. The dot product of two vectors 
is interpreted in terms of projecting a shadow of one vector onto another, 
and is applied to the problem of finding the angle between two vectors or 
lines. The cross product of two vectors is interpreted as a vector whose 
magnitude is an area. Both dot and cross products can be used in problems 
involving finding the areas of plane figures and the volumes of solid objects. 


4.1 The dot product 


Definition 
The dot product of two vectors a and b is 
a-b = |al|b|cos8, 


where 0 (0 < 0 < 7) is the angle between the directions of a and b (see 
Figure 4.1). 


The dot product of two vectors is a scalar quantity, i.e. it is a real number: 
a-b is the product of the three scalars |a|, |b] and cos 6. So the operation of 
the dot product combines two vectors to define a scalar, and for this reason 
the dot product is also called the scalar product. The angle @ lies in the 
range 0 <@<7: the value of a-b is positive for 0 < @ < 5, ie. when @ is 
an acute angle; the value of a+b is negative for 5 <@ <7, ie. when @ is 
obtuse; the value of a+b is zero for 0 = 4, i.e. when @ is a right angle. 


It is important, when writing a dot product, to make sure that the dot 
between the vectors is clear. 


*Exercise 4.1 


Three vectors a, b and c of magnitudes 2, 4 and 1 units, respectively, lying 

in the same plane, are represented by arrows as shown in Figure 4.2. The 

angle between the vectors a and b is 3 radians, and that between the vectors 
Tw 


b and c is & radians. Use the definition of dot product to find the values of 
a+b, b-c,a-cand b-b. 


Section 4 Products of vectors 


The product a:b is read as 


‘a dot b’. 


Figure 4.1 


Figure 4.2 
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This exercise demonstrates two important properties of the dot product. 
(a) If two vectors a and b are perpendicular to each other (i.e. the angle 
between them is 5 radians), then since cos 5 = 0, 
a+b = |al| |b] cos 5 = 0. 
(b) The dot product of a vector with itself gives the square of the magnitude 
of the vector, i.e. 
a-a= |a||a|cos0 = |al”. 


The converse of (a) also holds: if a and b are two non-zero vectors such 
that a+b = 0, tne the definition of the dot product tells us that cos @ = 0; 
therefore 6 = 5 and the vectors are perpendicular. 


In the sae of real numbers, xy = 0 implies that either x or y (or both) 
is zero. In contrast, for the dot product, a- b = 0 gives an extra possibility: 
either a or b (or both) is the zero vector, or the angle between a and b is 
5 radians. 


Properties of the dot product 


The following are some important properties of the dot product of two vec- 
tors. They include the rules for manipulating dot products in algebraic 
expressions. 


Properties of the dot product 


Let a, b and c be vectors, and let m be a scalar. 

1 a-bisascalar. 

2 a-b=b-<a, i.e. the dot product is commutative. 

3 a-(b+c)=a-b+a-cand(a+b)-c=a-c+b-<e, ie. the dot 
product is distributive over vector addition. 

4 (ma)-b=m(a-b) =a: (mb), ie. a scalar can be ‘moved through’ 
a dot product. 

5 If neither a nor b is the zero vector, then a- b = 0 if and only ifa 
is perpendicular to b. 


fo) 


a-a=|al’. 


The following example shows how these properties can be used to simplify 
expressions. 
Example 4.1 


Expand the expression x- y, given that x = 2u+ v and y = u— 5v. Cal- 
culate its value when u and v are perpendicular unit vectors. 


Solution 
= (2u+ v)-(u— 5v) 
= (2u) - (u—5v)+v- (u-— 5v) (Property 3) 
= (2u)-u+ (2u)-(—5v) +v-u+v-(—5v) (Property 3) 
= 2(u-u) — 10(u-v) +v-u—5(v-v) (Property 4) 
= 2(u-u) — 9(u- v) — 5(v- vv) (Property 2) 


Now u-u = |u|? =1 and v- v =|v|? = 1 when u and v are unit vectors. 
Furthermore, u+-v = 0 when u and v are perpendicular vectors. So when u 
and v are perpendicular unit vectors, we have 


x-y=2-0-5=-3. Hf 
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These properties can all be 
derived from the definition of 
the dot product, but the 
derivations are not given here. 


*Exercise 4.2 
(a) Expand the expression (a+ b)-+(a—b). 
(b) Expand the expression |a + b|?. 


Exercise 4.3 
Given that a and b are perpendicular unit vectors: 


(a) find the value of m such that the two vectors 2a + 3b and ma + b are 
perpendicular; 


(b) find the value of |c| if ce = 3a+ 5b. 


Finally, a word of caution: (a-b)c is not in general the same as a(b~c). 
The vector (a+ b)c is a scaling of c by the number a- b, whereas a(b-c) isa 
scaling of a by the number b-c. Clearly these two vectors are not generally 
even parallel, let alone equal. For example, if a= b =i and c = j, then 


(a-b)c=(i-i)j=j but a(b-c) =i(i-j) =O. 


The component form of the dot product 


We saw in Section 3 that an arbitrary vector a in three dimensions may be 
expressed in terms of the Cartesian unit vectors as 


a = aji+t+ aj + agk = | ag 


With this representation, vector addition and scaling become simple alge- 
braic operations without any reference to diagrams. The definition of the 
dot product was expressed in terms of the magnitudes of two vectors and 
the angle between them. We shall now see how to express the dot product 
in terms of components of vectors. 


First observe that, by definition, i, j and k are unit vectors and are perpen- 
dicular to one another (see Figure 4.3). Thus: 

i-k=k-i=0, j-k=k-j=0; 

k-k=1. 


i-j=j-i=0, 
J -j = ils 
If two vectors a and b have component forms a = aji+ aaj + a3k and 
b = b1i+ boj + bgk, then the dot product of a and b may be written as 


(ayi + a2j + ask) : (byi + boj + bsk). 


We can now apply Properties 3 and 4 of the dot product and the above rules 
for combining i, j and k to this expression to obtain a very simple formula 
for the dot product of vectors in component form. Specifically, we have 


(ayi + a2j + agk) + (byi + boj + b3k) 


ae i. 


Section 4 Products of vectors 


Recall that jal? =a-a. 


In general, if m is a scalar 
and a is a vector, we can 
write ma or am as 
convenient, although ma is 
more usual; thus a(b - c) 
means the same as (b- c)a. 


Figure 4.3 


Note that for the 
right-handed system shown, 
the unit vector i points out of 
the plane of the page towards 
you. 


= ayzi- bhi+ azi- baj + ayzi- b3k 
+ aaj + b1i + aaj + boj + aoj + b3k 
+ agk + byi + a3k + boj + agk - b3k 
= b(t) Gy batt) PGbslixk) 
+ agbi(j +i) + aobo(j -j) + a2bs(j- k) 
4 Gaby (k= t) -asba (= 9) aabe(k=10) 
= ab + agb2 + azb3. 
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This extremely important formula is worth remembering. 


Component form of the dot product 
If a= ayi+ aaj + a3k and b = bi + bj + b3k, then 
a-b = a,b + agbe + azbs. 


*Exercise 4.4 


Ifa = 4i+ j — 5k and b =i-—3j+k, show that a- b = —4. What does the 
negative sign tell us? 


The angle between two vectors 


The component form of the dot product has an important application in 
calculating the angle between two vectors. You have already seen that if 
a+b = 0 and neither a nor b is zero, then a and b are perpendicular. For 
instance, if a = 2i—j and b = 2i+ 4j, then a- b = (2 x 2) + (-1 x 4) =0, 
so the angle between a and b is 5 radians. In general, the equation defining 
the dot product of a and b, i.e. a- b = |a| |b| cos 0, gives the following simple 
expression for finding the angle between a and b. 


Angle between two vectors 
The angle 0 between any two non-zero vectors a and b is given by 


a:b a,b, + agb2 + a3b3 
cos 0 = = 


lallb] — /az + a3 + a3 /bt +03 + 03” 
where 0 <@0< 7. 


Example 4.2 

(a) Find the angle between the vector a = i+ /3k and the a-axis. 

(b) Find the angle between the vectors a = i+ V3k and b = V3i— 2j + 3k. 
(c) Show that ¢ = —2V/3i + 2k is perpendicular to a = i+ V3k. 


Solution 


(a) The direction of the x-axis is the same as the direction of i, and the 
angle 6 between a and i is given by 
a-i ay, _ 1 _ 1 


jafiJ fal VT+3 2 


Thus the angle between a and the z-axis is 3 radians. 


(b) We have |ja| = /14+3 = 2, |b] = V3 +44+9=4 and 
a-b=(1 x V3)+ (0 x —2) + (V3 x 3) = 4V3. 
Therefore the angle 0 between a and b is given by 


_ 4v3 vB 
Ox4° 2° 


so d= e radians. 


cos 0 = 


os 0 
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(c) To test whether a and c are perpendicular, we calculate their dot prod- 
uct: 


a-c = (i+ V3k) - (—2V3i + 2k) 
= (1% —9/3)4+ (0x 0) + (/5 x2) 
=. 


Since a-c = 0 and a and € are non-zero vectors, c is perpendicular 
toa. 


*Exercise 4.5 
Consider the vectors 
a=2i-3j+k and b=-—i+ 2j+ 4k. 


Find the magnitudes of a and b, and the angle between them. 


Resolving a vector into components 
The dot product has a useful geometric interpretation. 


*Exercise 4.6 


If a = ayi+ aoj + agk, find the values of a- i, a-j anda-k. 


The solution to Exercise 4.6 shows the important fact that the i-component 
of any vector a may be found by taking the dot product a-i. The j- and 
k-components can be found similarly (by taking dot products with j and k, 
respectively). 


We can also find the components of a vector in other directions. Suppose 
that a vector a, represented by OA, makes an angle 6 with a unit vector u 
(see Figure 4.4). Draw the line AP perpendicular to the direction of u. 
Then the distance OP is seen from simple trigonometry to be |a| cos 8. Now 
observe that the dot product of a and u is 


a-u=|a||u|cos?=|OP| (since |u| = 1). 


The distance OP represents the component of a in the direction of u. 


Definition 


The component of a vector a in the direction of an arbitrary unit 
vector Wis a+ U. 


*Exercise 4.7 
Consider the vectors 
a=2i-3j+k and b=-—i+4 2j+ 4k. 
(a) Which of the following vectors is perpendicular to a? 
c=-i+j+3k, d=-2i+k, e=-i-—j-—k. 


(b) Find the component of the vector a+ 2b in the direction of the line 
joining the origin to the point (1,1, 1). 


Section 4 Products of vectors 


Figure 4.4 


Note that a-u will be 
negative if @ > 5, ie. if P and 
u lie on opposite sides of O. 
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Resolving vectors will be a vital technique in subsequent units, and some- 
times you will need to be able to resolve a vector into components in di- 
rections other than horizontal and vertical. For example, suppose that two 
forces, N and W, are acting at a point on an inclined plane (see Figure 4.5). 
These forces can be represented by vectors, and you will see that it may 
be convenient to take axes as shown, with i pointing up the plane. It is 
then necessary to be able to resolve N and W into components along and 
perpendicular to the plane. 


The dot product method of obtaining components always works, but a ge- 
ometric view is also useful. This follows because the component of a vector 
a in the direction of a unit vector u is 

a-u = |a|cosJ@, 
where @ is the angle between a and u. We summarize the method as a 
procedure. 


Procedure 4.1 Resolving a vector into components 


Given a vector a and a unit vector u, to find the component of a in 

the direction of u, do the following. 

e Find (usually from a diagram) the angle 6 between a and u (with 
0<@0< a). 

e The component of the vector a in the direction of the unit vector 
u is |a| cos 0. 

e Ifnecessary (for example, if 6 > 5), use the trigonometric formulae 
from the Handbook to simplify the result. 


The following example uses Cartesian unit vectors that are not horizontal 
and vertical. 


Example 4.3 


Suppose that the unit vector i points up a plane which is inclined at an angle 
a to the horizontal, and the unit vector j is perpendicular to the plane, as 
shown in Figure 4.6. Find the i- and j-components of the vectors N and W. 


Solution 
It is easy to resolve the vector N into its component form: 
N = 01+ |N|j = |N]j. 


For the vector W, we note from the geometry of the diagram that the angles 
between W and i, and W and j, are given by 5 + a and m — a, respectively. 
Applying Procedure 4.1 twice (with u =i and then u = j) allows us to 
resolve W into its component form: 


W = [W|cos(F + a) i+ |W] cos(m — a) j 


=—|W|sinai—|W|cosaj. 


So the i- and j-components of N are 0 and |N| respectively, and those of W 
are —|W]|sin a and —|W|cosa. 


*Exercise 4.8 


The two-dimensional vectors v and w in Figure 4.7 have magnitudes 1.5 
and 2, respectively. Resolve v and w into their i- and j-components. 
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Figure 4.5 


Figure 4.6 


Figure 4.7 


Exercise 4.9 


In Figure 4.8 the point P lies on a line making an angle a with the z-axis. 
The vectors a, b, c, d have magnitudes 1, 1.5, 1.5 and 2, respectively, and 
point in the directions shown. 


Figure 4.8 


Resolve each of these vectors into their i- and j-components. 


*Exercise 4.10 


Figure 4.9 shows a configuration similar to Figure 4.8, but with the unit 
vectors i and j aligned along and perpendicular to the line, respectively. 


BY 


Figure 4.9 
Resolve each of the vectors a, b, c and d into their i- and j-components. 


*Exercise 4.11 


The vectors p, q and r in Figure 4.10 have magnitudes 2.5, 3 and 2.5, 
respectively. Resolve p, q and r into their i- and j-components. 


YA 


Figure 4.10 


Section 4 


Products of vectors 
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Exercise 4.12 


The sum of the two-dimensional vectors a, b, c in Figure 4.11 is the zero vec- 
tor, and |c| = 2. By resolving the vectors into their components, determine 
the magnitudes of a and b. 


4.2 The cross product 


You have seen that the dot product of two vectors is a scalar (ie. a real 
number). In contrast, the cross product of two vectors is a vector, whose 
direction is perpendicular to both. The cross product has numerous appli- 


‘ : : : . Fi 4.11 
cations in geometry and mechanics, as you will see later in the course. =e 
Definition 
The cross product of two vectors a and b is 
_ : A The product a X b is read as 

a X b = ({al |b| sin 8) c, nthe es 
where 6 (0 < 6 < 7m) is the angle between the directions of a and b, and 
¢ is a unit vector perpendicular to both a and b, whose sense is given 
by the right-hand screw rule as shown in Figure 4.12. direction 

of screw’s 
motion 


The angle @ between two vectors a and b lies in the range 0 < 0 < 7, so 
sin > 0 and hence |a||b|sin@ > 0. So the cross product of a and b is a 
vector with magnitude |a| |b| sin @ and direction defined by ¢. The direction 
of ¢ is the direction in which the screw in Figure 4.12 would advance when 
turned from a towards b through the angle 6. Notice that ¢ is not defined 
if a and b are parallel or if a or b is the zero vector; but in these cases 
|a| |b| sin 0 = 0 so we take a X b =O. The cross product is also called the 
vector product, which stresses the fact that a x b is a vector. 


The order of writing down a and b is very important. According to the 
screw rule, b X a is a vector in the direction opposite to a x b. Figure 4.13 
shows what would happen to the screw in Figure 4.12 if we turned from b 
to a: it would ‘unscrew’. The unit vector d in the direction of b X a is in 


the opposite sense to €, i.e. d = —c. Hence Figure 4.12 
b X a= (|b| |al sin 0) d = —(|b| |a| sin @)c¢ = —(a x b). 
direction 
*Exercise 4.13 ____ssessSsSsSSSSSSSSSSSSSSSssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssese ore | 


Three vectors u, v and w lie in the (x, y)-plane. Their magnitudes are 2, 3 
and 4 units, respectively, their directions make angles §, 3 and & radians, 
respectively, with the positive x-axis, and they have positive j-components. 
Use the definition of the cross product to find the vectors u X v, u X w and 


VX Ww. 


Exercise 4.13 illustrates an important property of the cross product. If two 
vectors a and b are parallel, then the angle 6 between their directions is zero 
or 7 radians, so the cross product of a and b is the zero vector, because the 
magnitude of the vector, i.e. |a| |b| sin 0, is zero. The converse also holds: if 
a and b are two non-zero vectors such that a x b = 0, then the definition 
of the cross product tells us that sin 6 = 0; therefore 6 = 0 or 0 = a, and the 
vectors are parallel. We can also deduce that 


Figure 4.13 


axa=0O_ for any vector a. 
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So we can test for perpendicular vectors by using the dot product and for 
parallel vectors by using the cross product. 


Properties of the cross product 


The following are some important properties of the cross product of two 
vectors. They include the rules for manipulating cross products in algebraic 
expressions. 


Properties of the cross product 


Let a, b and c be vectors, and let m be a scalar. 

1 ax bisa vector. 

2 bxa=-—(axb). 

3 aX (b+c)=(a X b)+(a X c) and (a+b) X c=(a X c)+(b X Cc), 
i.e. the cross product is distributive over vector addition. 


4 (ma) xX b= m(a xX b) =a X (mb), ie. a scalar can be ‘moved 
through’ a cross product. 


5 If neither a nor b is the zero vector, then a x b = 0 if and only if 
a and b are parallel. 

6 axa=0. 

7 In general, a x (bX c) #(axXb) Xc. 


These properties can all be derived from the definition of the cross product, 
but the derivations are not given here. Note in particular Property 2: the 
cross product is not commutative — the order does matter. 


The component form of the cross product 


*Exercise 4.14 
(a) Show that ix j=k,j x k=iandkxi=j. 
(b) Calculate j x i,k x jandixk. 
(c) Calculate i x i, j x jandk xk. 
(d) Expand and simplify 
(i+k) x (i+j+k) and (ix (i+k)) —-((i+j) xk). 


The cyclic pattern of the products i x j, j x k, k x i and of the products 
ix k, k x j, j x i, as demonstrated in Exercise 4.14, can be remembered 
using Figure 4.14. For example, if we go round the circle clockwise starting 
at i, we have 


ixj=k, jxk=i, kxi=j. ' 
However, if we go in an anticlockwise direction, the cross products are neg- 
ative: 
k j 
ixk=, kxj=i gxia—e Ms 


If two vectors a and b have component forms a = aji+ a2j+a3k and Figure 4.14 
b = bji+ b2j + b3k, then the cross product a xX b may be written as 
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axb= (ayi + agj + ask) x (bi + boj + bsk) 
ai X (by i + boj + bsk) 
+ a2j xX (b1i + bej + b3k) 

+ agk X (b1i + boj + bsk) 
ayi X bi + ayi X bej + ayi X b3k 
+ aaj X byi + aaj X bej + aaj X 3k 

+ ask X bi + agk X boj + agk x b3k 
= aybo(i x j) + ayb3(i x k) 

+ aby (j x i) + a2b3(j x k) 

+ a3bi(k x i) + a3bo(k x j) 

= (a2b3 = a3b2)i + (a3by — a,b3)j 


+ (ayb2 = a2b,)k 
We highlight this important formula. 


Component form of the cross product 


If a = ayi+ aaj + agk and b = b1i + boj + b3k, then 


axb= (a2b3 — a3b2)i + (a3by — a1b3)j + (a,b2 _— a2b,)k 
a2b3 = agb2 
= a3b1 —_ a,b3 
abo = andy 


This formula is not easy to remember or use. For this reason, simpler meth- 
ods have been devised, such as the following, which is known as Sarrus’s 
Rule. Given two vectors a= [a, a2 as]? and b=[b; bo bs)", draw a 
tableau with i, j and k in the top row, then repeat i and j. In the second 
row do the same with the components of a, and in the third row with those 
of b. Then following the diagonal lines as shown, and multiplying the en- 
tries, gives the corresponding components of the cross product a x b, which 
are the elements on the fourth row of the tableau. 


a a 
s bo 
ee 


—agbik —azb9i —-a, agb3i a3byj aybok 
(The diagonals pointing to the right yield positive terms, while those point- 


ing to the left have a minus sign.) 


Example 4.4 
Ifa =2i+j—k and b =i-— 3j + 4k, findaxb. 
Solution 


Since a, = 2, ag = 1, a3 = 
above gives 


—1 and b, = 1, bo = —3, b3 = 4, the formula 


ax b= ((1 x 4) — (-1 x —3))i 
+((-1 x 1) — (2x 4))j 

+ ((2 x —3) — (1 x 1))k 
= i— 9j — 7k. 
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(using Property 3) 


(using Property 3) 


(using Properties 4 and 6) 


(using the results above). 


Another quick way to 
evaluate cross products is to 
use determinants. This 
method is introduced in 
Unit 9 when we discuss 
determinants. If you already 
know this method, then we 
suggest that you continue to 
use it. 


Section 4 Products of vectors 


Alternatively, using the tableau, we have 


soa X b =i-— 9j — 7k, as before. I 


*Exercise 4.15 
Ifa = 2i— 3j+k, b = —i+ 2j)+ 4k and c = —4i+ 6j — 2k, findaxb,axc 
and b X c. From your results, what can you say about a and c? 

*Exercise 4.16 


If a= 2i+ 2} +k and b = 4i + 4j — 7k, find a unit vector whose direction 
is perpendicular to the directions of both a and b. 


b 
We close the section, and the unit, with some useful geometric applications 
of the cross product. The following example is the first step. 
Example 4.5 8 
Any two non-zero and non-parallel vectors a and b define a parallelogram, a 
as shown in Figure 4.15. Express the area of the parallelogram in terms of 
ax b. Figure 4.15 
Solution 
The area A of the parallelogram defined by the two vectors a and b is b lal 


the same as the area of the rectangle of height |b|sin@ and width |a| (see 
Figure 4.16). Thus A = |a||b|sin 0, and this is the magnitude of a x b. So 


A=|axb|. 


Area of a parallelogram 


The area of a parallelogram with sides defined by vectors a and b is 
la x bl. 


This idea is easily extended for the area of a triangle. Any two non-zero, 
non-parallel vectors a and b define a triangle (see Figure 4.17). The area of 
this triangle is half that of the corresponding parallelogram, so it is sla x bl. 


Area of a triangle 
The area of a triangle with sides defined by vectors a and b is sla x bl. Figure 4.17 


187 


Unit 4 Vector algebra 


Using the formula for the area of a parallelogram, it is easy to find the 


volume of a parallelepiped (see Figure 4.18). This is given by A parallelepiped is like a 
: . . distorted brick. All of its 
volume of parallelepiped = base area x vertical height h faces are parallelograms. 


=|(ax b)-el. 


Here we have made use of the fact that the base is a parallelogram (assumed 
to be in the (a, y)-plane) defined by the vectors a and b. The base therefore 
has an area equal to the magnitude of a x b. Now the vertical height h 
is the component of the vector c in the direction of the Cartesian unit 
vector k pointing vertically upwards, i.e. it is the z-component of c, given 
by c-k=k-c. So the volume of the parallelepiped is Ja x b|(k-c). But 
the vector product a X b points vertically upwards and can therefore be 
expressed as |a X b| k. Hence the volume of the parallelepiped is 


ja x b|(k-c) = (ja x b|k)-c=(ax b)-c. 


Figure 4.18 


Of course, the scalar (a x b)-c can be negative if one of the defining vec- The scalar quantity 

tors a or b is chosen to be in the opposite direction to the one chosen in (a X b) +c is an example of a 
Figure 4.18, or if the order of the cross product is reversed. The modulus scalar triple product. 

signs in the formula |(a x b) +c] ensure that the volume comes out positive. 


End-of-section Exercises 


Exercise 4.17 

(a) Is a-ba vector? 

(b) Can a- b be negative? 

(c) What is special about a and b if a- b = |a| |b]? 
(d) If a- b=0, what can you say about a and b? 
(e) Ifa x b=0, what can you say about a and b? 


Exercise 4.18 


Suppose that the vectors r and s are directed towards north and north-east, 
respectively, and define r x s = t. 


(a) What is the direction of t? 

(b) In what direction is s x r? 

(c) In what direction is t x r? 

(d) If |r| = |s| = 1, what is |t|? 

(e) Calculate the vector t x (r X s). 

(f) If |r| = |s| = 1, what is the value of r-s? 
( 


g) If |r| = |s| = 1, what is the value of s- (t x r)? 


Exercise 4.19 


Find the value of (a x b)- a for any non-zero vectors a and b. 


188 


Outcomes 


Outcomes 


After studying this unit you should be able to: 


e understand the meaning of the terms scalar, vector, displacement vector, 
unit vector and position vector, and know what it means to say that two 
vectors are equal; 

e use vector notation and represent vectors as arrows on diagrams; 

e use the plane polar coordinate representation of the magnitude and di- 
rection of a vector, and convert between the polar coordinates and the 
Cartesian coordinates of the endpoint of a vector drawn from the origin; 


e scale a vector by a number, and add two vectors geometrically using the 
triangle rule (or the parallelogram rule); 


e resolve a vector into its Cartesian components, and scale and add vectors 
given in Cartesian component form; 


e calculate the dot product (scalar product) and cross product (vector 
product) of two given vectors; 


e determine whether or not two given vectors are perpendicular or parallel 
to one another; 


e determine the magnitude of a vector and the angle between the directions 
of two vectors; 


e write down the vector equation of a given straight line; 
e resolve a vector in a given direction; 


e manipulate vector expressions and equations involving the scaling, ad- 
dition, dot product and cross product of vectors; 


e use the cross product to determine the area of a parallelogram or triangle. 
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Solutions to the exercises 


Section 1 
Ld 
y 
b 
34a 
2 
1 
3 
T 1 i 
O I 2 3 * 
1.2 
NA 
Derby 
57 km 


Leicester 


0 10 = 30 
Scale (km) 


Birmingham 


1.3 


70 mph 
60° 


0 10 20 30 40 50 60 70 
Scale (mph) 


1.4 f=b, as both are of length 2 units and both point 
in the positive y-direction. 


1.5 The completed table is as follows. 
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Cartesian Polar 
coordinates (x, y) coordinates (r, ¢) 


(0,1) (Le) 
(1,1) (V2, =) 
(24/2, —2,/2) (4,-7) 
(—6,0) (6, 7) 
(—1,-1) — 3m 

(-1,7) 


(2.003 x 107, 9.797 x 107) (108, exp(0.17 )) 


The entry in row 6 is an invalid entry because r must 
be non-negative. 


1.6 The most obvious choice is the Cartesian coordi- 
nate system with origin at Bristol (see Figure 1.3 on 
page 156), the z-axis pointing east ba the y-axis point- 


ing north. Then r = 296 and ¢= 3 — i — Ss. Hence 
s = (296, 2%). Another choice would have the origin at 


Bristol but the z-axis pointing from Bristol to Leeds, 
in which case you would have s = (296,0). (Infinitely 
many other choices are possible.) 


1.7 Scalar quantities: temperature, volume, energy, 
time. 

Vector quantities: velocity, force, displacement, accel- 
eration. 


1.8 Since (0, —3) lies on the negative part of the y-axis, 
we can aura write down the polar coordinates 
of O O@ as (3, —4), so | |OG| = = 3; 
ieee, using the formulae 

r= (0? + (-3)?)? =3, 

sind = —3/3 =-1, cosd¢=0 


gives the same results. 


Section 2 
2.1 (a) Leeds to Bristol: —d. 
Leeds to Leeds: 0. 

(b) (i) 2v (ii) —v 


(c) The vector —1.5v has magnitude 1.5|v| and direc- 
tion opposite to v. 


(iii) 0 


The vector —kv (k positive) has magnitude k|v| and 
direction opposite to v. 
(d) (i) The vectors AB and DC are equal in length 


and parallel, and point the same way (i.e. have the same 
direction). Thus 


AB = DC. 


(ii) The vectors BC and DA are equal in length and 
parallel, but point in opposite directions. Thus 


BC =-DA (or, equivalently, DA= -BC). 


1 1 
(e) —visascaling of v by the positive scalar m = — 


[v| 


The direction of — 


the magnitude is m|v| = 


22 a= Wa 015i c= 9}; d= <j. 


2.3 (a) —35j (where |j] represents 1 km per hour). 
(b) —112i (where |i] represents 1 mile). 


(c) 112i (where |i] represents 1 mile). 


2.4 (a) 


(b) 


20 


ya 


v is thus the same as that of v, and 


Solutions to the exercises 


2.6 The following sketch illustrates the associative 
property. 


(a+b)+e 
=a+(b+c) 


Y 


(To evaluate (a+b)+c, we go first to a+b (in 
the lower quadrant) and then add c. To evaluate 
a+(b-+c), we go first to a (along the x-axis) and then 
add b+ c.) 

The following sketch illustrates the distributive prop- 
erty. 


2(a + b) 
=2a+2b 


2.7 4(a—c) +3(c — b) + 2(2a — b — 3c) 
= 4a — 4c + 3c — 3b+ 4a — 2b — 6c 
= 8a—5b-—7c 


2.8 (a) 2b+ 4x = 7a, therefore 
4x = 7a — 2b, 
so 
x = ta— $b. 
(b) n(b —a) +x =m/(a-— b), therefore 
x = m(a— b) — n(b—a) 
m(a—b) + n(a — b) 
=(m-+n)(a—b). 


I 
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2.9 


2.10 The magnitude of v is 4.7 times the magnitude 
of u. v is parallel to u, but the sense of v is opposite 
to the sense of u, i.e. v and u have opposite directions. 


2.11 By the triangle rule, 
AB + BC = HC, 
CD+DA=CA. 
But CA = —AC, so we have 
AB + BC +0D+DA= AC - AC =0. 


C 


ty 


D 


Alternatively, AB + BC = AC. Hence 
AB + BC + CD = AC +CD=AD, 


so 


AB+BC+CD+DA=AD+DA=0, 
since AD = “7A, 


(There are various other possible arguments!) 


2.12 The vectors p, q and p+ q are sketched below. 


5p = (15, 5), 
Since the directions of p and q are at right angles, 
Ip +q| = V32 +42 =5 and $ = § + arctan 4 = 2.498 
radians, so 

p+q= (5, 2.498). 


=o > (4,0). 
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2.13 (a) The first rule is true and the second is false 
(scalar multiplication does not change direction). 


(b) The proposed rule does not hold. (Consider r + r, 
for example, where r = (r,¢). The proposed rule gives 
r+r = (2r,2¢), whereas actually r+r = 2r = (2r, ¢).) 
(There is an algebraic rule for adding vectors in polar 
form, but it is rather unwieldy. This is one reason why 
Section 3 introduces the Cartesian representation of a 
vector, for which there is a simple algebraic rule for the 
addition of vectors.) 


Section 3 

321 asisht of 
b=2j=[0 2) 
ce=i=[1 0]? (=a) 
d=-2i=[-2 oj? 
e=-i+2j=[-1 2]? 
f=2j=(0 2\* (=b) 
g=i-j=[1 -1]7 
h=3i=([3 oj? 


3.2 (a) a=3i+j, b=-i+3j, c= —3i-2j, 
d =3i+j (=a). 


(b) 
Ya 
3:5 


3.5a = 3.5(31+ j) = 10.51 + 3.5j, as in the diagram. 


(c) Ya 


Le 
O 1 y) 3 4: 


a+b = (3i+j) + (—i+ 3j) = 2i+4j, as in the diagram. 

(d) 2a+b—c = 2(31+ j) + (-i+ 3j) — (-3i— 2j) 
= 8i+ 7j. 

Thus |2a+ b —c| = /82 +7? = V113. 


3.3 (a) Given r =s, we can equate the corresponding 
components. Thus 

pt+q=-3 and p—q=7, 
which gives p = 2 and q = —5. 


m enorve a(li]+[)) 


Hence |t| = V2. 


3.4 (a) lal =2, |b| =3, lel =—1. 
(b) Use the formulae x = rcos¢ and y = rsin@ (see 
Subsection 1.5). 


First consider the vector a. The Cartesian components 
of a are the numbers a; and ag given by 


ag = 2sin = = V3. 


ay= 2cos = = 1, 
Thus 
a= ait aaj =it+ V3j. 
Similarly for b and c: 
b = (3cos 3") i+ (3sin 24) j = -4i+ Sj 
= (cos Z)i+ (sin 2) j= v3i 4 + 4). 
(c) We now have 


ate= (i v3i) (414 33) 


=(14 Bis (Vie) 


~ 1.8661 + 2.239). 


3:5 The components are 
= 5cos i ~ 3.214, 
= 5sin > ae aoe 
ies The magnitudes are 
lal = (V3)? + (-1)?)'? =2, 
|b] = ((—3)? + 32)!/2 = 3V/2 ~ 4.243. 
To specify the directions, we need a reference direction. 
Using the plane polar coordinate convention, we can 
specify the angle @ with respect to the positive x-axis. 
Thus, for vector a, 
cos ¢ = a1/|a| = V3/2, 
sin = a2/lal = -1/2, 
hence ¢ = —§. 


For vector b, 
cos @ = —3/(3V2) = -1/Vv2, 
sin ¢ = 3/(3V2) = 1/V2, 
hence ¢ = an 


3.6 Systems (b), (c) and (d) are right-handed. 


Solutions to the exercises 


3.7 (a) d= 2(i+j +k) — 3(2i— 3j —k) 
= —4i+ 11j + 5k, 

= (i+ j+k) — 2(2i—3j —k) + 4(31+k) 
= 91+ 7j + 7k. 

(b) |d| = /(—4)? + 112 + 5? = V162 (= 9v2), 
le| = 92+ 724 7? = V179. 

(c) jal = VP?+2P?4+P = V3. 

A unit vector in the direction of a is 


1 
ti hp, 
al? V3! ) 


(d) Ifa+x=b, then 
x = b-—a= (2i-— 3j—k) 
= i-—4j— 2k. 
Thus the components of x are 1, —4 and —2. 


3 2 1 
3.8 p=] 0}-—]|] 3] =]|-38],s0 
18 —l 19 


|p| = (1? + (-3)? + 19?)!/? = 3711? (~ 19.26). 


3.9 Relative to the origin of the Cartesian coordinate 
system, the two points have position vectors i+ j+2k 
and 21+ 3j-+k. Thus the vector equation of the line is 


= (1—s)(i+j+ 2k) + s(2i+3j+k) 


= (1+4.s)i+ (1+ 2s)j+ @ —s)k, 
where —co <5 < ow. 
3.10 (a) |a| = \/2? + (-1)? = V5, 


bl = VETS + BF = VB. 
The vector a lies in the (x, y)-plane, and the angle ¢ 
that it makes with the x-axis is given by cos ¢ = 2//5 
and sin d = —1/\/5. Hence ¢ ~ —0.4636 radians. 
(b) a+b = 3i + 2j + 5k, 
2a — b = 3i — 5j — 5k, 
c+ 2b — 38a = —4i+4+ 10j + 8k. 
(c) The vector PG is equal to 2a — b. The point Q is 
the end of the vector 00, which is given by 
0@ = OP + PQ 
= (2j + 3k) + (383i — 5j — 5k) 
= 3i— 3j — 2k, 
so Q is the point (3, —3, —2). 


3.11 0=[0 0 Oj], 
i=[f o of, 
j=[0 1 Oo, 
ka (0- o. dF, 
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Section 4 


4.1 a-b=|a||b|cos6 =2 x 4x cosZ =4, 


b-c¢ = |b| |c|cos@ = 4 x 1 x cos 2 = 2V3, 


a-c= |a||c|cos@ = 2 x 1 x cos (% + 2) 
=2cos> =0, 


b-b= |b||b| cos = 4 x 4 x cos0 = 16. 


4.2 (a) (a+b)-(a—b) =a-(a—b)+b-(a—b) 
=a-a-—a-b+b-a—b-b 
=a-a—a-b+a-b—b-b 
=a-:a—b-b 

(b) ja+ bl? = (a+b)- (a+b) 

=a-(a+b)+b- (a+b) 
=a-at+a-b+b-a+b-b 
=a-:a+a-b+a-b+b-b 
=a-a+2a-b+b-b 


4.3 (a) If2a+3b and ma+ bare perpendicular, then 
(2a+ 3b) -(ma+b) =0. 

Expanding this expression, 
2ma-a+2a-b+3mb-a+3b-b=0. 


Now a and b are perpendicular, soa: b=b-a=0O, 
and they are unit vectors, soa-a=b-b=1. Thus 


2m+3=0, 
som=-—1.5. 
(b) cl? =c-e 
= (3a + 5b) - (8a + 5b) 
=9a-a+15a-b+15b-a+ 25b-b. 
Thus, since a and b are perpendicular unit vectors, 
|c|? =9 + 25 = 34, 
so |c| = 34 (x 5.831). 


4.4 a-b=(4x1)+(1 x —3)+(-5 x1) =—-4. 
The negative sign tells us that the angle between a and 


b is between 5 and 7 radians, i.e. it is an obtuse angle. 


4.5 |a| = 2? + (-3)?+ P=Vv14, 
|b] = \/(—1)? + 2? + 42 = V21. 


Also, 
a-b= (2x —-1)+(-3 x 2)+(1 x 4) = -4, 

so if @ is the angle between a and b, then 

a-b | —4 _ 4 

jal|b] 14x V2I 76 


The negative sign means that 6 is obtuse, so # ~ 1.806 
radians. 


cos @ = 
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4.6 a-i= (aji+ aoj + ask) -i 
=ajli-i+a2j-it+agk-i 
= a}. 

Similarly, 

a-j=a. and a-k=asz3. 

(Notice that this means that the components of a vec- 

tor are given by the dot products of the vector with the 

Cartesian unit vectors i, j, k.) 


4.7 (a) a-c=-—2,a-d=-3,a-e=0. 
Thus only e is perpendicular to a. 
(b) First, 

a+ 2b =j+ 9k. 
Now a suitable vector along the line joining the origin 
to the point (1,1, 1) isi+j+k. The corresponding unit 
vector is u= ali +j+k). The component of a+ 2b 
in the direction of this line is 

u+(a+2b) = (i+ j+k)- (+ 9k) 


Sf § 


T 


4.8 The angle between i and v is %, and that be- 
tween j and v is } — § = %. Also, |v|=1.5. So the 
i-component of v is 


_—3y v3 _ 3v3 
lvicos§ =5 X= 5E, 


and the j-component of v is 


m—3y1—3 
. oo a 
The angle between i and w is 5 + § = 7B, and that be- 
tween j and w is t— | = or, Also, |w| = 2. Moreover, 


using the formulae from the Handbook, 


|v| cos 


cos 2% = cos(m — 3) 
x 


= COs 7 COS 3 +sin 7 sin 3 


=— Ls 
= COS 3 
1 


— B 


and 


cos 5 = cos(m — 2) 
n 


= COS 7 COS § + sin 7 sin ; 


—_ us 
= — COS § 
= uwe 
=-¥. 
The i-component of w is therefore 
2m _ 1 
|w|cos = = —-2x 5 =—1, 


and the j-component of w is 
|w| cos 7 = —2 x v3 — v3. 
In summary, 


v= 838143) and w= -i-V3j. 


4.9 The technique here is the same for all the vectors. 
One must find the angle between the vector in question 
and the unit vectors i and j. 


Vector a points vertically downwards, in the direc- 
tion —j. Hence i and a are perpendicular, and 


a=0i-j=-j. 


The angle between i and b is a, and the angle between 


j and b is | — a. Hence the i-component of b is 
|b] cosa = 1.5 cosa, 

and the j-component of b is 
|b| cos(F — a) = 1.5sina, 

where we have used the formula 
cos(3 — a) = cos Pcosa + sin Bsin a 

to evaluate cos(4 — a) (see the Handbook). So 
b=1.5cosai+1.5sinaj. 


The angle between i and c is 5 + a, and the angle be- 
tween j and c is a. 


Ya 


av 


Therefore the i-ccomponent of c is 
|c| cos(F + a) = —1.5sina, 
where we have used the formula 
cos(3+ a) = cos 3cos a — sin Bsina 
to evaluate cos($ + a) (see the Handbook). 
The j-component of c is 
|c| cosa = 1.5 cosa, 
so 
c= —-1.5sinai+ 1.5cosaj. 


Finally, the angle between i and d is 7— a, and the 
angle between j and d is 5 + a. 


Ya 


sv 


Thus the i-component of d is 


|d| cos(a — a) = —2cosa, 
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and the j-component of d is 
|d| cos( + a) = —2sina 
(using the usual trigonometric formulae). So 


d = —2cosai-—2sinaj. 


4.10 As in the previous exercise, we must find the 
angles between a, b, c, d and the unit vectors i and j. 
First notice that b points in the direction i, so b = 1.5i. 
Similarly, d points in the direction —i, so d = —2i. 
Also, ¢ points in the direction j, so c = 1.5j. 


YA 


N\A 


The remaining vector, a, makes an angle 5 + a with i, 
and an angle 7 — a with j. Hence the i-ccomponent of a 
is 


Vv 


jal cos(F + a) = —sina, 
and the j-component of a is 

|a| cos(7 — a) = — cosa. 
Therefore 


a= —sinai-—cosaj. 


4.11 Here the angle between i and p is m— a, and 


the angle between j and p is | —a. Therefore the i- 
component of p is 
|p| cos(a — a) = —2.5 cosa, 
and the j-component of p is 
|p| cos(F — a) = 2.5sina. 
Thus 
p= —2.5cosai+2.5sinaj. 


The angle between i and q is 7 — (, and the angle be- 
tween j and q is $ +. Therefore the i-component of 


q is 
|q| cos( — 9) = —3 cos 8, 
and the j-component of q is 
|q| cos(F + 8) = —3sin £. 
Hence 
q = —3cos Zi -— 3sin Bj. 
Finally, the angle between i and r is y, and the angle 
between j and r is 5 +. Thus the i-component of r is 
|r| cos y = 2.5 cosy, 
and the j-component of r is 
|r| cos($ + y) = —2.5sin 7. 
So 
r=2.5cosyi—2.5sin yj. 
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4.12 The i-component of a is clearly zero, while the 
j-component is simply |a|. Similarly, the i:component 
of b is —|b| while the j-component is zero. Hence 
a=|a|j and b=-—\bji. 
The i- and j-components of ¢ are, respectively, 
|c| cos $ = 2 x 4g = V2 
and 


|c| cos(Z + 7) = —2sin = —V2, 


so 
c= V2i- V2j. 
Given that a+b+c=0, the sum of all the i 


components of a+ b-+c must be zero, and so must the 
sum of all the j-components. Therefore (i-components) 


0—|bl+V2=0 
and (j-components) 
ja| +0-—/2=0. 
Thus we see that |a| = |b| = V2. 


4.13 For the sake of clarity, here is a diagram showing 
u, v and w (where all three vectors start at O) drawn 
in the (x, y)-plane. (The z-axis points out of the page.) 


YA 


O 


The cross products are all perpendicular to the (x, y)- 
plane. 
A unit vector in the direction of u X v is k, so 

u X v = (|u| |v|sin Z)k = (2 x 3 x §$)k = 3k. 
The angle between u and w is zero, so 

u X w = (\u||wi|sin0)¢ = (2x 4x 0)¢ = 0c = 0. 
A unit vector in the direction of v x w is —k, so 

Vv X w= (|v| |w|sin 2) (—k) = (3 x 4 x $)(—k) 

= —6k. 
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4.14 (a) i, j and k are unit vectors forming a right- 
handed system. 


Thus, using the definition of the cross product, 
ix j = (lil llsin 3k =k. 
Similarly, 
jxk=i and kxi=j. 
(b) Since (a x b) = —(b X a) for any vectors a and b, 
we have 
jxi=-k, kxj=-i and ixk=-j. 
(c) Since a x a= 0 for any vector a, we have 
ixi=jxj=kxk=0. 
(d) (i +k) x (G+j4+k) 
= (ix (i+ j+k))+(kx (i+j+k)) 
=(0+-k+(-j) + G+ (1) +8) 
=—-i+k, 
(x +1) - (+3) xo 
= (0+ (-j))—(i+i) 
= -i. 


4.15 To compute a x b, we use Sarrus’s Rule: 


i j k i j 
o 3 1 - 2s 
= 2 4 -1 2 

Se = 8 -12%i -j 4k 


soa X b=—14i—9j+k. 
Similarly for a x ¢: 

i j k i j 

2 -3 1 2 -3 

—4 6 -—-2 —-4 6 
—-12k -6i 4j 6i —4j 12k 

soaxc=0O. 
Finally, for b x ec: 


i j k i j 

= 2 a 2 

-4 @ —-9 <4 6 
a) re, =4i 16] —6k 


so b X c = —28i — 18j + 2k. 


Since a X c = 0, and neither vector is zero, the vectors 
a and ¢ are parallel. In fact, c = —2a. 


4.16 A vector perpendicular to a and b is a x b, which 
we can compute using Sarrus’s Rule: 


i j k i j 
2 2 1 2 2 
4 i ae 4 4 
—8k —-4i 14j -14i 4j 8k 


soaxX b= —18i+ 18}. 
We are asked for a unit vector, so the obvious choice is 
1 —18i+ 18j 
__(axb) = isbn aed 
la x b| 18/2 
= 73(-i+J)- 
(Note that wii — j) is also a unit vector perpendicular 


to a and b. This can be obtained by considering b x a 
rather than a X b.) 


4.17 (a) No, it is a scalar. 


(b) Yes, if the angle between a and b is between 5 and 
m radians. 

(c) If either a=0O or b=0O, then indeed a+b = 
|a||b| (= 0). So assume that a and b are both non- 
zero. If a+ b = |a| |b| cos @ = |a| |b], then cos @ = 1, so 
? = 0, i.e. a and b are in the same direction. 

(d) Ifa-b=0, then either a= 0 or b = 0 (or both) 
or a and b are perpendicular. 


(e) Ifa x b =O, then either a = 0 or b = 0 (or both), 
or a and b are parallel (but may have opposite senses). 


d 


4.18 (a) t is perpendicular to both r and s, and its 
sense is vertically down, i.e. into the ground. 


(b) Conversely, the sense of s X r is vertically up. 


(c) t Xr is perpendicular to t (and thus in the hori- 
zontal plane) and perpendicular to r, and by the screw 
rule its sense is due east. 

= in= =~ 
(d) |t|=|r||s|sin | = Js 
(e) tx (rxs)=txt=0 


= nm — i 
(f) r+s = |r||s|cos t= Js 


(g) s-(t Xr) =|s||t x rjcos} (by part (c)) 
= |s| (|t| |r| sin 3) cos 4 
1 


+~xixixte=i 


= saa fa = 2 


4.19 (ax b)-a=0 for any non-zero vectors a and b, 
because a X b is perpendicular to a, and the dot prod- 
uct of perpendicular vectors is zero. 


(If a and/or b is the zero vector, then the answer is still 
ZeYO.) 


Solutions to the exercises 
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absolute error 70, 80 

absolute error bound 70 
absolute value 9 

accumulation 64 

addition of vectors 165, 170, 175 
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analytic solution 83 

angle between vectors 180 
approximate solution 75 

arbitrary constant 41, 67, 94 
arccos 25 
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arctan 25 
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Argand diagram 30 

argument of a complex number 30 
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asymptote 39 
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birth rate 63 

boundary condition 138 
boundary value 138 
boundary-value problem 138 
bounded above 39 

bounded below 39 


Cartesian components of a vector 169, 174 


Cartesian coordinates 159, 164, 172 
Cartesian unit vectors 164, 174 
Chain Rule 36 

characteristic equation 114 

closed form of a recurrence system 10 
closed interval 8 

codomain of a function 25 
coincident roots 28 

column vector 169 

commutativity 166 

complementary function 125 
complex conjugate 28 

complex exponential 31 

complex number 27 

complex-valued function 37 
component form of a vector 169, 174 
component form of cross product 186 
component form of dot product 180 
component of a vector 169, 174, 181 
composite function 21 

Composite Rule 36 

composition of functions 21 

constant of integration 41 
constant-coefficient equation 112 
continuous function 40 

continuous model 10 

cosecant 23 

cosine 23 
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cost 81 
cotangent 23 
cross product 184, 185 


De Moivre’s Theorem 31 
death rate 63 
decay constant 89 
decimal places 7 
decreasing function 38 
definite integral 48 
dependent variable 10, 65, 85 
derivative 32, 65 
derived function 32 
difference of two squares 16 
differentiable 66 
differential equation 66 
explicit solution 87 
first-order 66 
general solution 67 
homogeneous 91 
implicit solution 87 
inhomogeneous 91 
linear 91 
non-homogeneous 91 
order 66 
particular solution 67 
solution 66 
differentiation 31 
of a complex-valued function 37 
direct integration 84 
direction field 72 
direction of a vector 155, 157 
discrete model 10 
discriminant 15, 120 
displacement 154, 156 
displacement vector 156 
distributivity 166 
division of complex numbers 28 
domain 69 
domain of a function 10, 44 
dominate 144 
dot product 177, 178 


efficiency 81 

equal roots 28 

equal vectors 158 

equation of a straight line 176 
error 70 

error bound 9 

Euler’s formula 31 

Euler’s method 78 

explicit solution 87 

exponent of an exponential 17 
exponential form of a complex number 
exponential function 17, 18, 129 


factorization 15 
first derivative 33 
first-order differential equation 66 
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formula method for a quadratic equation 14 
function 10 
function notation for derivatives 33 


Gaussian elimination 13 

general solution 110, 120, 121, 125, 126 
general solution of a differential equation 67 
global maximum 39 

global minimum 39 

gradient 12, 32, 72 


half-open interval 8 
homogeneous differential equation 91 
homogeneous equation 112 


image set of a function 10 
imaginary part of a complex number 27 
implicit differentiation 37 
implicit solution 87 
increasing function 38 
indefinite integral 41, 84 
independent variable 10, 65, 85 
index of an exponential 17 
inhomogeneous differential equation 91 
inhomogeneous equation 112, 124 
initial condition 68, 136 
initial value 68, 136 
initial-value problem 68, 136 
input 64 
input-output principle 64 
integer 7 
integral 42, 84 
integrand 41 
integrating factor 94 
integrating factor method 95 
integration 41 

by parts 47 

by substitution 45 
interval 8 
inverse function 18 
inverse trigonometric functions 25 
irrational number 7 


Leibniz notation 33 

linear constant-coefficient second-order differential 
equation 112 

linear differential equation 91 
linear function 12 

local maximum 38 

local minimum 38 

log plot 19 

logarithm function 18 
logistic equation 65, 72, 90 
log-linear plot 19 

log-log plot 20 

lower bound 39 


magnitude of a number 9 
magnitude of a real number 155 
magnitude of a scalar 155 
magnitude of a vector 155, 170, 175 
maximum 39 


method of undetermined coefficients 127, 131 
minimum 39 

modulus of a complex number 29 

modulus of anumber 9 

modulus of areal number 155 

modulus of a vector 155 

multiplication of a vector by a scalar 162 
multiplication of complex numbers 30 


natural logarithm function 18 

Newtonian notation 33 
non-homogeneous differential equation 91 
non-homogeneous equation 112 

nth derivative 33 

nth root 17 

nth-order polynomial 28 


open interval 8 

order of a derivative 33 

order of a differential equation 66 
orientation of a vector 157 
output 64 


parallel vectors 162 
parallelepiped 188 
parallelogram rule 166 
parameter 11 

particular integral 125, 131 
particular solution 111, 135 


particular solution of a differential equation 67 


perfect square 16 

periodic function 23 
perpendicular vectors 178 

plane polar coordinates 159 

polar coordinates 29, 159 

polar form of a complex number 30 
polynomial function 128 
polynomial of degree n 28 
population model 63 

position vector 168, 174 

power function 20 

power of an exponential 17 
principal value of the argument 30 
principle of superposition 113 
Product Rule 35 

proportionate birth rate 63 
proportionate death rate 63 
proportionate growth rate 64 


quadratic equation 14 
quadratic function 14 
Quotient Rule 35 


rate of change 32 

rational number 7 

real number 7, 155 

real part of a complex number 27 
recurrence system 10 

reducing the step size 79 

relative error 71 

repeated roots 28 
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resolving a vector into components 171, 181 solving inhomogeneous equations 126 
resultant 165 speed 155 
right-hand rule 173 stationary point 38 
right-handed system 173 steady-state solution 143 
roots step length 77 
of a polynomial equation 28 step size 77, 79 
of a quadratic equation 14 subtraction of vectors 165 
sum of 14 sum of vectors 165 
rounding 8 
rounding error 81 tangent function 23 


third derivative 33 
transient 142 

transient solution 142 
transpose symbol 169 
trial solution 127 
triangle rule 165 
trigonometric functions 22 
trigonometric identities 26 


Sarrus’s Rule 186 

scalar 155 

scalar multiple 162 

scalar multiplication 162 
scalar product 177 

scalar triple product 188 
scaling 162, 166 

scaling of a vector 162, 170, 175 


scientific notation 7 undetermined coefficients, method of 127, 131 
screw rule 173 unit vector 163, 174 

secant 23 upper bound 39 

second derivative 33 

sense of a vector 157 variable 10 

separable differential equation 87 vector 155, 167 

separation of variables method 87 vector addition 165, 170, 175 

sign of areal number 155 vector addition rule 165 

significant figures 7 vector equation of a straight line 176 
simultaneous linear equations 13 vector product 184 

sine 23 vector subtraction 165 

sinusoidal function 130 velocity 155, 157 

slope 12, 32, 72, 75 volume of a parallelepiped 188 
smooth function 40 

solution of a differential equation 66 (a,y)-plane 172 

solution of a quadratic equation 14 

solving homogeneous equations 120 zero vector 157, 165, 166 
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